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The Hopping Hoop Revisited 


Timothy Pritchett 


In this MONTHLY, T. Tokieda [1] recently provided a novel discussion of the 
hopping hoop problem described by Littlewood [2]. Unfortunately, the analysis in 
[1] is incorrect. We show that if the “no slipping” condition imposed in [1] and [2] 
is strictly adhered to, the hoop never becomes airborne. Essential for the observed 
hop of the hoop is a phase in which there is slippage at the point of contact of the 
hoop with the supporting surface. In order to expose this and other subtleties 
not considered in [1], we treat a somewhat more general problem. The view of 
Littlewood’s hopping hoop problem presented here is consciously different from 
that given in [1]. In offering this alternative way of thinking about Littlewood’s 
hopping hoop, we hope to deepen the reader’s intuitive understanding of precisely 
why the hoop hops. 

We consider here a rigid circular hoop of radius R and mass (1 — A)M that, at 
least initially, rolls without slipping along a flat surface. An additional object of 
mass AM is rigidly attached to the rim of the hoop, with the point of attachment 
coinciding with the centroid of the object. Here, M is the total mass of the 
combined system consisting of the hoop and the attached object. The parameter A 
is the ratio of the object mass to the total; in the limit A — 1, the problem reduces 
to the case of the massless hoop considered by Tokieda and Littlewood. The 
center of mass of the combined system, shown as a dot in Figure 1, lies a distance 


center of mass 


center of hoop 
(Xe: Yc) 


x 


Figure 1. The hopping hoop: a circular hoop of mass (1 — A)M with an attached object of mass AM. 


AR from the center of the hoop and is situated on the line segment connecting the 
center of the hoop with the attached object. Let @(¢) be the angle which this line 
segment makes with the vertical and letx(t) and y(t) be, respectively, the horizon- 
tal and vertical coordinates of the center of mass of the system. 

The three coordinates x(t), y(t), and @(t) are not independent, but are subject 
to various constraints: 


1. Since the hoop is assumed to be a rigid object that suffers no deformation as 
it rolls, it is always true that (x(t) — x,(t))* + (y(t) — y(t)? = A’R’, or 
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equivalently, 
x(t) —x.(t) = RAsin 6(t) (1) 
y(t) —y,(t) = Racos 6() (2) 
where x,(t) and y.(t) denote, respectively, the horizontal and vertical coordi- 
nates of the center of the hoop. 

2. Since the supporting surface is also assumed to be rigid, the vertical height of 
the center of the hoop must satisfy y.(t) = R so long as the hoop and the 
surface are in contact. 

3. As long as there is no slipping, the horizontal position of the center of the 
hoop and the angular coordinate @(t) are related by x,(t) = R@(t), where we 
have tacitly set x.(0) = Re) = 0. 


Taken together, these three constraints imply that as long as the hoop rolls without 
slipping along the supporting surface, the center of mass of the system moves along 
a curtate (shortened) cycloid, also known as a trochoid. 

Every constraint is maintained by a corresponding force. In particular, con- 
straints 2 and 3 result from the force of contact between the hoop and the 
_ supporting surface: the condition y.(t) = R is maintained by the so-called normal 
force, i.e., the component of the contact force acting perpendicular to the surface, 
while the “no slipping” constraint, x.(t) = R@(t), is maintained by friction, i.e., by 
the component of the contact force acting parallel to the surface. Constraint 1, 
the “rigid hoop” constraint, is maintained by forces within the hoop itself; 
because these are internal to the system under consideration, they need not 
concern us here. 

All forces external to the system, including those responsible for maintaining 
the constraints just enumerated, appear in the equations of motion of the center of 
mass of the system, which are obtained from Newton’s Second Law: 

X(t) = f(t) (3) 

y(t) =n(t) —8 (4) 
Here, g, n(t), and f(t) represent, respectively, the force per unit mass (accelera- 
tion) due to gravity, to the normal component of the contact force, and to friction. 
In order to simplify the equations, we divide all forces by M, the total mass of the 
system. For brevity we continue to refer to the resulting quantities as “forces” even 
though they are actually accelerations, i.e., forces per unit mass. When the hoop 
loses contact with the supporting surface, n(t) = f(t) = 0, and the equations (3) 
and (4) describe a parabolic trajectory. 

The most general motion of a rigid body consists of a translation of the center 
of mass, combined with a rotation about an axis containing the center of mass. In 
the present case, this rotation is described by the angular variable 6(t), whose time 
evolution is governed by the following equation, obtained by considering the 
torques about the axis perpendicular to the plane of Figure 1 and passing through 
the center of mass of the system: 


ee 
77 C4) = MOL) ~ 2-()] — FO (1) (5) 


In this expression, J is the moment of inertia of the system (hoop + object) about 
the axis through the system center of mass. It is given by J = MR*{1 — A* + dé}, 
where the last term in brackets requires some comment. Unless the attached 
object is a point mass (a mathematical idealization), it has a non-vanishing moment 
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of inertia about any axis through its centroid. That “internal” moment of inertia is 
proportional to the mass AM of the attached object and to the square of some 
length parameter related to the dimensions of the object in the plane perpendicu- 
lar to the axis of rotation; the moment of inertia is of the form AMb?’. For 
example, for a cylinder of uniform mass density (a battery!), attached to the hoop 
so that its longitudinal axis is aligned with the hoop, b = L/Vy12, where L is the 
length of the cylinder; for the same cylinder, attached to the hoop in such a way 
that its longitudinal axis is perpendicular to the plane of the hoop, b = r/v12, 
where r is the cylinder radius; and so on. For a point mass, b = 0. Instead of b, 
however, we choose to work with the dimensionless parameter € = (b/R)*. Since 
€ <1 in almost all cases, one might wonder why we make a large fuss over a 
quantity that, in practice, is negligible small. The answer is that we wish to examine 
the limit A — 1, in which the mass of the hoop is negligible relative to that of the 
attached object. In this limit, the center of mass of the system coincides with the 
centroid of the object, and the system’s angular momentum relative to its center of 
mass resides entirely in the attached object spinning about its centroid; this 
angular momentum is zero if the corresponding moment of inertia vanishes, as is 
the case if we set to zero both the mass of the hoop and the “internal” rotational 
inertia of the attached object. Alternately, we observe that (5) makes sense only so 
long as J # 0, and, in the case of a massless hoop, J is nonvanishing only if we 
account for the rotational inertia of the attached object about its own centroid. 
The importance of (5) becomes clearer when we relax the constraint that the hoop 
rolls entirely without slipping. 

Returning to (5), we make the following important observation: When hoop and 
supporting surface part company, the contact forces n(t) and f(t) vanish and (5) 
reduces to 6(t) = 0. Thus, the hoop simply rotates about its center of mass at a 
constant angular speed equal to its angular speed at the instant the contact forces 
went to zero. At the same time, equations (3) and (4), which describe the 
translational motion of the center of mass, reduce to the equations for a body 
falling freely with a constant downward acceleration. This simultaneous free 
fall/free rotation continues until the height y(t) of the center of the hoop 
decreases once again to R, at which time n(t) acquires the positive value required 
to enforce constraint 2. In this way, we arrive at an alternative view of why the 
hoop hops: it is simply rotating about its center of mass, as that center of mass falls 
freely under the influence of gravity. Littkewood’s hopping hoop is reminiscent of a 
good high jumper, who is skilled at rotating and “deforming” her body in such a 
way that all of its component parts clear the bar, even as her center of mass passes 
under it! 

The motion of the system is completely determined by equations (3), (4), and 
(5), along with the relevant constraints and initial conditions. We have already 
noted that constraints 1, 2, and 3 are simultaneously satisfied only if the center of 
mass of the system moves along the trochoid: 


x(t) = R(@(t) + Asin 0(t)) (6) 
y(t) = R(1 + Acos 6(f)) (7) 


By combining equations (3) through (7) one may obtain a single second-order 
differential equation for one of the variables (6(t), say) from which the motion of 
the system may be determined. The result will be valid as long as the constraints 
~ embodied by (6) and (7) hold, i.e., as long as the hoop rolls without slipping along 
the supporting surface. Alternately, one may observe that the constraints imply 
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that x(t) = y(t)e(t) and y(t) = (x(t) — x,(t))A(t), and use these relations to 
combine (3), (4), and (5) into a single equation, from which the forces of constraint 
n(t) and f(t) are absent and which, more importantly, may be written as a total 
derivative. In this way, one arrives at the equation of energy conservation. 
Physically, the absence of the forces of constraint from the equation of energy 
conservation reflects the fact that forces of constraint do no net work because they 
act perpendicular to the configuration space of the system. If now we assume, as 
Tokeida does, that the attached mass initially moves horizontally with speed vp, 
energy conservation requires 


5M(x? + y?) + 310° + Mgy = M((1 + A)(4up + gR) + gupre), 
where the additional terms not seen in the corresponding equation in [1] arise from 


the rotational kinetic energy of the (massive) hoop and attached (extended) object. 
Adding the trochoid constraint, equations (6) and (7), we obtain 
g ((L+A +4 Ae/2)c + 2Asin’ (6(t)/2)) ; 
R 1 + Acos(OA(t)) + AEe/2 (8) 
where c = 1, /(4gR). This result may be used in conjunction with (4) to compute 
- ¥(@(t)), as in [1]. Tokieda observes that the hop must occur at the first value of 
6(t) for which the parabolic trajectory of the center of mass in free fall departs 
from above from the trochoidal trajectory imposed by the rolling hoop, ie., the 
hop occurs when j(@(t)) = —g. But, referring to (4), this is precisely when 
n(t) = 0, i.e., when the normal component of the force of contact between the 
hoop and the supporting surface vanishes, a condition that certainly seems to 
suggest that the hoop has just lost contact with the supporting surface and is about 
to become airborne. That is not the case, as we now prove. 

Combining (4), (7), and (8), we obtain the following rather formidable expres- 
sion for the normal force n as a function of € = cos 0(t). 

§ 
Ne) =) where 
ie (eA + 2EA + a? 
p(é) =(1 -caA€é) de? 
Sen((e6°a4-2ee + 2646 = 36° 1) A Ae = 1) eA 4). 9) 
SO(CE Pe Oe SH Qe PAA, 
HCE 266 te ee ee Ly (C2 )yene 2) 

We now specialize to the case of a massless hoop (A = 1) and determine the angle 
6, for which n > 0 by finding the roots of the cubic polynomial p(é) in (9). The 
result is most illuminating if we express it as a series expansion in the “inertia 
parameter” e. The physically relevant root is 

(1—c) | (ce +3) 1 
= Cr. fay 

(c + 1) 4 (c + 1) 
If the attached object is a point mass (€ = 0), we recover from (10) the result given 
in [1] for the critical angle 6, at which, it is claimed, the hoop loses contact with 
the supporting surface. If that is indeed the case, then the subsequent motion of 


the system must consist of a translation of the center of mass along the parabolic 
trajectory corresponding to free fall, 


x(t) =x, +x(t—1¢,) (11) 
y(t) =y, + y(t - 41) — ge(e— hy’, (12) 


6(t)° = 


e?] + O(e)’. (10) 


cos 06, =c 
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combined with a rotation about the center of mass at constant angular speed 06,: 


Here, (x,, y,) and (x,, y,) are the Cartesian coordinates of the position and 
velocity of the center of mass at the instant ¢, at which the normal force vanishes, 
and 6, is the corresponding angular speed of rotation. Series expansions of these 
quantities are obtained from (10), in conjunction with (6), (7), and (8). If one now 
substitutes (12) and (13) into (2), replacing y,, y,, 0,, and 6, by their expansions in 
€, one obtains the following expression for the height y,[t] of the center of the 
hoop at times ¢ > ¢,: 


 .. — (L—c)((e + 4)c* + 8c + 3€ + 4) ; ; ‘ 
R =a | eae 7 + O(e) +O(tT), (14) 


where 7 = yg/R(t — ¢t,). Now, c must be strictly less than unity, or else the 
requirement that the hoop initially rolls without slipping cannot be satisfied; for 
c>1 Ge., for vp => ~4gR), the hoop “glides” immediately without rolling, as 
Tokieda points out [1]. Since c < 1, the term of order tT” in (14) is negative. This 
means that an instant after the normal force between the hoop and the supporting 
surface goes to zero, the center of the hoop is moving not upward, but downward, 
a result that one could have obtained equally well by computing the acceleration 
y(t,)! Thus, even though at ¢, the normal component of the contact force is 
instantaneously zero, it cannot remain zero for a nonvanishing duration. This is 
because at ¢, the center of the hoop is accelerating downward, and a positive 
normal force immediately results to enforce constraint 2 and prevent the height of 
the center of the hoop from falling below R. Equation (14) shows that a massless 
hoop cannot get off the ground if we insist that there be no slipping. In particular, 
this is true for the special case considered in [1] (e = 0: attached object is a point 
mass) as well as for Littlewood’s original, formulation (€ = c = 0: attached point 
mass with zero initial velocity) [2]. In the case of a massive hoop (A < 1), the 
problem must be treated numerically, but the result is the same: as long as we 
require that the hoop roll entirely without slipping, it will never become airborne. 

Even if the forces of constraint vanish instantaneously, the hoop and the 
supporting surface can part company only if y + R@sin 6 > 0, ie., only if the 
speed R@sin 6 at which the center of the hoop rises (due to the rotation of 
the system about its center of mass) exceeds the speed |y| at which the center of 
mass falls under the influence of gravity. However, as long as we stipulate that the 
hoop rolls entirely without slipping, imposing by fiat the trochoid constraint, then 
it follows from (7) that the magnitude of the quantities R@ sin 6 and y are equal. 
Right on the money is Littlewood’s [2] comment that, in practice, the hoop slips 
first (before hopping). 

At what point does the hoop begin to slip? By combining equations (3), (6), and 
(8), one obtains the following expression for the force f responsible for maintain- 
ing the “no slipping” condition, constraint 3. 


oe SSN. 
Sey (ne + DAE + 2p the?) i 


q(é) =2€A° + (2A — 3eA + c(EA + 2A 42) —6)EA4+2A—eA (15) 
+ c(€A* + Zed? + 3€A + 2A + 2) — 4A%* - 2 
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We remind the reader that (9) and (15) represent, respectively, the vertical and 
horizontal components of the contact force necessary at any given angle 6 = 
arccos (€) to keep the center of mass moving on the trochoidal trajectory dictated 
by constraints 1, 2, and 3. It is natural to ask whether the required force is indeed 
physically available. 

The normal force v is limited only by the resistance to tensile stress of the hoop 
and the supporting surface; in assuming that both the hoop and the surface are 
infinitely rigid, not subject to deformations of any kind, we allow n to be as large 
as is necessary to maintain constraint 2. In the real world, of course, no object is 
infinitely rigid. In the limit of small strains, however, the resulting deformations 
are elastic—energy-conserving—and the potential energy stored as elastic strain in 
the hoop and/or supporting surface just prior to the loss of contact provides an 
additional source of propulsion for the hop. 

The frictional force f, on the other hand, has a maximum value, and if this is 
exceeded, slipping occurs. It is known experimentally that the maximum possible 
magnitude of the force of static friction existing at the point of contact of two 
surfaces is proportional to the magnitude of the normal force of contact exerted by 
one surface on the other: 


Ifl <|fmax | = “17 (16) 


The constant of proportionality, ,, is known as the coefficient of static friction and 
it depends on the nature of the surfaces that are in contact. For most materials, pu, 
ranges from around 0.01 to 1.5 [3]. Once the point of contact between the hoop 
and the supporting surface begins to slip, the magnitude of the friction force is 
proportional to the magnitude of the normal force: |f(t)| = ,|n(t)|. The relevant 
constant of proportionality, the coefficient of kinetic friction ,, is always less than 
the coefficient of static friction w,, for any pair of surfaces in contact. This is 
because the “‘cold-welding” that serves to bind the surfaces together in the static 
case cannot occur when there is relative motion between the two surfaces [3]. Once 
slipping begins, the friction force is directed so as to oppose the relative motion of 
the two surfaces involved. For the skidding hoop, 


f(t) = pyn(t) sgn (R6(t) — £,(t)), (17) 


where sgn(x) is the sign function and n(t) > 0. Specifically, f(t) is positive 
(directed to the right) if the hoop is skidding forward, i.e., if the motion of the 
hoop relative to the point of contact is directed to the left, and f(t) is negative 
(directed to the left) if the hoop is skidding backward, i.e., if the motion of the 
hoop relative to the point of contact is directed to the right. 

The hoop begins to slip at the minimum angle for which equality holds in (16), 
i.e., for the minimum (real) root of {f(cos @)}* — {u,n(cos @)}* = 0, where the 
forces of constraint n and f are given by (9) and (15), respectively. For example, in 
the case of a massless hoop (A = 1) with attached point mass (e€ = 0), 4; is the 
lesser of arccos(c) and 2arctan(p,). The subsequent motion of the system is 
computed using (17) and numerically solving the coupled differential equations (3) 
and (5) subject to constraints 1 and 2. The former constraint (no deformations) 
translates simply into the twin conditions expressed by equations (1) and (2). 
Considerably more tedious to implement is the latter constraint, which must be 
enforced explicitly at each time step in the integration. One proceeds as follows: A 
provisional value of the normal force n(t) is obtained from n(t — At) by extrapola- 
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tion. From n(t), x(t), y(t), a(t), x(t), y(t), and @(t) one computes a provisional 
value of y.(¢ + At). If y(t + At) < R, one adjusts n(t) to give y(t + At) =R. 
The revised n is then used to compute the values of x, y, 0, x, y, and 6 at time 
t+ At. The procedure continues until y(t) is strictly greater than R, at which 
point the hoop is airborne and the subsequent motion of the system is given by 
(11)-(13), where the subscript “1” now refers to values at the final step in the 
numerical procedure, i.e., at the instant contact between hoop and supporting 
surface is lost. 

The implementation of the procedure just described is complicated by the fact 
that comparisons of approximate (floating point) quantities can be performed only 
to a certain tolerance. Thus, the stopping condition that y. be strictly greater than 
R is, in practice, y. — R > 6; that is, we consider the hoop to be in contact with 
the supporting surface and, more importantly, we continue to impose constraints 1 
and 2, until the point of contact clears the surface by at least 6. Since the 
parameter 6 effectively defines when the hoop is considered to be airborne, the 
amount of time during which the hoop rolls while slipping prior to takeoff depends 
on 6, and so too does the height of the hop. Roughly speaking, this is because the 
longer the “rolling with slipping” phase prior to takeoff, the greater the takeoff 
speed that the hoop can build up—more precisely: the greater the amount by 
which |R6 sin 6| can exceed ly|. Here, |y| is the speed at which the center of mass 
falls under the influence of gravity, while |R@ sin 6| is the speed at which the 
center of the hoop rises as a result of the rotation of the system about its centroid. 
Numerical simulations confirm that the height of the hop decreases with 6, so one 
might expect in the limit 6 — 0 to observe no hop at all. However, the 6 — 0 limit 
is not only in conflict with physical reality, it violates even the assumptions of this 
admittedly idealized problem! The point is simply this: Even if we were able to 
perform our numerical computations to infinite precision, 6 could still not be 
made arbitrarily small. It is, after all, microscopic imperfections in the hoop and 
the supporting surface that gave rise to friction in the first place, and the scale of 
these imperfections provides a physical lower bound to 6. In the present case, we 
began with the assumption that hoop and surface are sufficiently “rough” that the 
former would initially roll without slipping over the latter. This precludes 6 — 0. 

The preceding discussion might lead one to doubt that the hop can actually be 
observed in practice. This is not the case: the hop is real, as the photograph in 
Figure 2 shows. 

Figure 3 depicts schematically the results of a typical numerical simulation using 
parameter values for a system that one could potentially realize experimentally: 
the hoop is not massless, and the attached object is not an idealized point mass. 
Shown are the trajectory of the center of mass (black line) and the position of the 
center of mass at the onset of slipping and at the instant the hoop loses contact 
with the supporting surface (open circles). Dots indicate the positions of the 
‘attached object (large dots) and the center of the hoop (small dots) at twenty 
equally spaced times. Spokes (light gray lines) connecting simultaneous positions of 
the object and the center of the hoop have been added as an aid in visualization. 
The figure was generated using A = 0.95, e = 0.01, and c = 0.1, the latter value 
corresponding to the attached object moving horizontally with an initial speed of 
2 meters/sec. The coefficient of static and kinetic friction between the hoop and 
supporting surface were taken to have values 1.0 and 0.8, respectively. The 
simulation generating the figure used 6 = R/100; in simulations employing smaller 
values of 6, the hop is less evident, but it is always present. 


Aug.-Sept., 1999] THE HOPPING HOOP REVISITED 615 


Figure 2. A stroboscopic photo by Dan Schwalbe and Stan Wagon shows a small hop of the hoop, 
which is a plastic hula hoop and four brass rods. 


Figure 3. Simulated hop of a real hoop: results of a numerical simulation using A = 0.95, e = 0.01, and 
c = 0.1 and coefficients of friction uw, = 1.0 and uw, = 0.8. The large dots indicate the positions of the 
attached object and the small dots indicate the corresponding positions of the center of the hoop. The 
trajectory of the center of mass of the system is indicated by the black line; the two open circles along 
the center of mass trajectory indicate the position of the center of mass at the onset of slipping and at 
the instant of loss of contact with the supporting surface. 


We note in passing that if the hoop is massless (A = 0) and the attached object 
is a point mass (e = 0), (8) can be integrated in closed form to give the following 
rather cute solution for 0(t): 

t D t 
6(t) = 2arcsin| ye sinh} — Sh 2 arcsin| ———sinh| — 
2) R 2ygR 2) R 

In his original essay [2], Littlewood gives his discussion of the hopping hoop a 
somewhat philosophical turn by posing a question to which we, in closing, now 
turn. Wonders Littlewood: Is the behavior of the hoop intuitive? We have 
demonstrated that there can be no hop without some slipping of the hoop. This 


§ 
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conclusion may at first appear somewhat surprising. We hope, however, that the 
reader now agrees that the phenomenon of hopping can be quite intuitive, at least 
if it is explained in general terms. In this instance, as in countless others, the devil 
is in the details. 


ACKNOWLEDGMENT. Thanks for this article are due to Stan Wagon, who encouraged the author to 
develop a computer animation of the hopping hoop, and who supplied the stroboscopic photograph in 
Figure 2. 
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MONTHLY Volumes 1-100 Now Available Online 


The entire content of volumes 1-100 (1894-1993) of the MONTHLY 
is now available online at www.jstor.org. Each year, one volume will 
be added to the JSTOR archive, so that all but the most recent five 
years will be available. Access to JSTOR at present is available only 
to faculty and students at participating institutions. The MAA is 
working to make access available to other individuals. 


High-resolution graphic images of pages can be viewed and printed, 
and rapid full text searching capability is provided. For example, 
readers interested in what the MONTHLY has had to say about 


Fenchel’s Theorem over the years will find 67 instances of “‘fenchel” 
in volumes 1-100; clicking on the links returned by the search 
produces a citation of the article in which the term appears, the 
first page of the article, or the first page containing the term in 
that article. 


The JSTOR archive now contains complete runs of 117 journals in 
15 disciplines, including 10 mathematics journals. JSTOR is a 
non-profit organization whose initial funding was provided by 
The Andrew W. Mellon Foundation; its ongoing support comes 
from participating institutions and libraries, who in turn provide 
online access to the JSTOR archive for their members, students, 
and faculty. 
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Why Dickson Left Quadratic Reciprocity 
Out of His History of the Theory of Numbers 


Della Dumbaugh Fenster 


In a 1993 letter to the Notices of the American Mathematical Society, Irving 
Kaplansky called attention to an astonishing omission in the history of mathemat- 
ics [27]. “Everybody knows,” Kaplansky asserted, “that Dickson’s History of the 
Theory of Numbers covers all of number theory up to about 1918. Right?” “Wrong,” 
he answered solidly, “[t]ry looking up quadratic reciprocity.” Kaplansky is right. 
Leonard Eugene Dickson’s monumental compendium of the history of number 
theory excludes the history of quadratic reciprocity, the “crown jewel of elemen- 
tary number theory” [27]. Why? Why did Dickson leave this celebrated number 
theoretic result out of his History? Kaplansky offers a brief explanation: “he 
farmed it out to a student” [27]. Again, Kaplansky is right, on some level at least. 

In this paper, we offer further insight into this perplexing omission. In the 
process, we reveal an entirely new perspective on Dickson and unfold yet another 
example in the history of mathematics where extra-mathematical factors contribute 
to the development of mathematics. The history of Dickson’s History actually 
begins in the last decade of the nineteenth century. 

While Dickson pursued a Ph.D. at the young Chicago from 1894 to 1896, the 
then group-theoretically minded E. H. Moore inspired him to write a thesis on 
(what we would call) permutation groups [15]. Although group theory would 
remain among Dickson’s research interests throughout his career, he would add 
finite field theory, invariant theory, the theory of algebras, and number theory to 
his repertoire of research interests. In the spring of 1900, just a few months past 
his twenty-sixth birthday, the Chicago Mathematics Department invited Dickson to 
join them as an assistant professor. From this position, Dickson made significant 
contributions to the consolidation and growth of the algebraic tradition in America 
[23]. Specifically, Dickson spent forty years (all but the first two) of his professional 
career on the faculty at Chicago where he directed 67 Ph.D. students, wrote 18 
books and roughly 300 manuscripts, served as editor of the American Mathematical 
Monthly and the Transactions of the American Mathematical Society, and guided the 
American Mathematical Society as its President from 1916 to 1918 [3]. 

Yet, this mathematical workhorse, who played billiards and bridge by day and 
did mathematics from 8:30 to 1:30 a.m. every night [1, 377], interrupted his thriving 
pure mathematical career for nearly a decade to write a three-volume, 1500 page 
historical account of the theory of numbers. As he explained it himself, he 
undertook this project because “it fitted with my conviction that every person 
should aim to perform at some time in his life some serious useful work for which 
it is highly improbable that there will be any reward whatever other than his 
satisfaction therefrom” [17, 2:xxi]. Although he viewed it as “highly improbable,” 
this altruistic mission paid handsome rewards for Dickson as this historical study 
ultimately led to his celebrated work in the arithmetics of algebras [23]. 

Dickson’s description of this historical undertaking as “serious useful work,” 
however, proved more than accurate. This was no hastily written history of number 
theory. On the contrary, Dickson had planned both the content of his project and 
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the precise method he would follow to present the details of his study. He revealed 
the scope of his plans when he explicitly stated his bold intention to “give an 
adequate account of the entire literature of the theory of numbers” [17, 1:iii]. As 
for his method, the following excerpt from Dickson’s History reveals both the 
thoroughness of his study and the historiographic view he maintained throughout 
this work. On the development of the theory-of perfect numbers, he included, for 
example, that 


Hrotsvitha, a nun in Saxony, in the second half of the tenth century, 
mentioned the perfect numbers 6, 28, 496, 8128. 

Abraham Ibn Ezra (1167), in his commentary to the Pentateuch, Ex. 3, 15, 
stated that there is only one perfect number between any two successive 
powers of 10. 

Rabbi Josef b. Jehuda Ankin, at the end of the twelfth century, recom- 
mended the study of perfect numbers in the program of education laid out in 
his book “Healing of Souls.” 

Jordanus Nemorarius (1236) stated (in Book VII, props. 55, 56) that every 
multiple of a perfect or abundant number is abundant, and every divisor of a 
perfect number is deficient. He attempted to prove (VII, 57) the erroneous 
statement that all abundant numbers are even. 

Leonardo Pisano, or Fibonacci, cited in his Liber Abbaci of 1202, revised 
about 1228, the perfect numbers 


22(27 — 1) = 6, $2°(2°— 1) = 28, $25(2 — 1) = 496 


excluding the exponent 4 since 2*— 1 is not prime. He stated that by 
proceeding so, you can find an infinitude of perfect numbers [17, 1: 5]. 


In 1500 pages, Dickson never swerved from this comprehensive, facts-only style of 
writing. This strict style, in the opinion of the number theorist D. N. Lehmer, made 
“the book...not so much a history as a list of references from which a history of 
the theory of numbers might be written” [28, 131-132]. 

To be sure, the reviews of this historical text indicate that Dickson made minor 
errors in his account. The operative word here is minor—he did not omit major 
contributions to number theory from his compendium—save the one under discus- 
sion. In fact, the reviews of this masterpiece suggest that Dickson accomplished his 
historical endeavor with the same prowess as his work in pure mathematics. As 
Robert Carmichael, a number theorist who read the proof sheets for the entire 
second and third volumes, expressed it in his review for this MONTHLY, 


To give an adequate account of the entire literature of so vast a subject 
and one of such long history as the theory of numbers is an undertaking of 
enormous magnitude; and it is carried through in this work with a marvelous 
success in the presence of which one must pause in admiration. Henceforth 
this history will be indispensable to all investigators in the theory of 
numbers... It is a piece of work for which one cannot find a parallel in the 
whole of scientific history [5, 397; 403]. 


Dickson’s History remains the classic reference on number theory up to 1918. It 
provided—and provides?—an “indispensable” source for those lacking adequate 
library facilities [5, 397]. In particular, as Dickson intended, the many “amateurs” 
interested in mathematics benefited from this (reputably) comprehensive, available 
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account of number theory [17, II: xx] and [5, 397].' As for the professional 
mathematician, Lehmer emphasized “the greatest need for just such a piece of 
work to promote efficiency among the professional workers in this field and to 
prevent them from wasting their time on problems that have already been 
adequately treated, and also to suggest other problems which still defy analysis” 
[28, 132]. Lehmer made this point in his review of volume I of Dickson’s History. 
The research mathematician would gain much more than “efficiency” by the time 
all three volumes appeared in print. 

Dickson’s “systematic” study of Diophantine Analysis for the second volume of 
his History, for example, provided him with a unique, sweeping perspective on this 
area of mathematics. From this vantage point, Dickson could assert that “[slince 
there already exist too many papers on Diophantine Analysis which give only 
special solutions, it is hoped that all devotees of this subject will in future refrain 
from publication until they obtain general theorems on the problem attacked if not 
a complete solution of it. Only in this way will the subject be able to retain its 
proper position by the side of other virile branches of mathematics” [17, 2: xx]. 
Dickson, in no uncertain terms, made this assertion with authority. Who better 
than a prominent research mathematician studying the “disjointed elements” of 
-Diophantine Analysis, could so confidently declare in essence that “[i]deas rather 
than computations are needed in this field”?* Dickson’s firm grasp on the past 
allowed him to see what would lead to a prosperous future for Diophantine 
Analysis.° Interestingly, he himself would devote the final fifteen years of his 
mathematical career focused on establishing a general result in Diophantine 
Analysis. But we have gotten ahead of our story. Simply put, Dickson made the 
history of number theory work in very utilitarian ways—tfar beyond serving solely 
as a reference volume—for the research mathematician. * 

Even still, Dickson’s purportedly complete history of the theory of numbers 
lacks the quintessential topic of elementary number theory, the law of quadratic 
reciprocity. This law relates the solvability of the congruences x* = p (mod q) and 
x* = q (mod p) for p and g distinct, odd primes. Specifically, if p or qg is of the 
form 4k + 1 (for k € Z), the two congruences are both solvable or both not 
solvable. If p and q are both of the form 4k +3 (for k € Z), one of the 
congruences is solvable and the other is not. In terms of the Legendre symbol, for 


‘In the letter to President R. S. Woodward of the Carnegie Institution, where he first put forth the 
idea of a History of Number Theory [13], Dickson described his “aim to make a volume indispensable 
to the specialists, but also a magnet to draw hold the attention of those non-specialists who desire to 
secure a connected scientific account of the subject.” This theme of appealing to professional and 
amateur alike appears throughout Dickson’s correspondence with the Carnegie Institution regarding his 
History. 

* Both quotations are from [6, 72-73]. This emphasis on general results could easily be viewed as a 
“Dicksonian trademark.” In his development of the definition of an algebra, Dickson sought the 
definition that yielded the theory with the widest applications. Similarly, in his work on the arithmetic 
of algebras, Dickson built his definition of an integral element, the crucial concept in the theory, using a 
“strategy of enlargement” as employed by Kummer and Gauss rather than defining an integral element 
on a case-by-case basis. See [22] and [23, 139-143; 152]. 

>In [7, 262], Carmichael emphasized the value of such forecasting when he wrote that “[w]hen a 
master, with the work of the past well in mind, tries to see the trend of the future, his judgement will be 
a matter of interest whether or not the direction of progress turns out to be such as he anticipates. It 
may even throw some light on the difficult question as to the way in which new discoveries arise.” 

*Dickson’s History inspired—in the most general sense of the word—one prominent twentieth- 
century mathematician. Richard Guy purchased Dickson’s History when he was about seventeen and 
found it “better than getting the whole works of Shakespeare and heaven knows what else” [2, 136]. 
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p and gq distinct, odd primes, 
()(2) - nl) 


This law, as Dickson described it himself, “is doubtless the most important tool in 
the theory of numbers and occupies the central position in its history. Its general- 
izations form a leading topic, past and present, in the theory of algebraic numbers” 
[18, 30]. Since the development of algebraic number theory grew, in large part, out 
of efforts to generalize quadratic reciprocity, it seems all the more unusual that a 
supposedly comprehensive History of the Theory of Numbers included no discussion 
of this area.° 

Why, then, did Dickson exclude an account of “this most important” tool from 
his History? The historical record suggests that Dickson did not intend for this 
omission to occur. In his closing remarks in the preface to volume II, Dickson 
refers to a Volume III as the “concluding” volume in the series [17, 2: xii]. Volume 
III appeared in 1923, “promptly” prepared, as Dickson described it in the preface, 
“owing to the favorable reception accorded to the first two volumes of this history” 
(17, 3: iii]. Early in the text of this third volume, nestled in his history of binary 
quadratic forms, Dickson points us forward to a fourth volume [17, 3: 3]. In this 
parenthetical remark, Dickson indicated his plan to include the quadratic reciproc- 
ity law in the fourth volume. But, of course, as we know now, the fourth volume 
never appeared. What happened to the fourth volume? 

The fourth volume involves Albert Everett Cooper, a University of Chicago 
graduate student from 1924-1926. Cooper had come to Chicago from the Univer- 
sity of Texas where he had earned three degrees and taught as an instructor in the 
mathematics department for 5 years [9, vita following dissertation text]. Arriving as 
he did in the fall of 1924, Cooper met a Dickson who had just spent over a decade 
collecting references on number theory, “digesting” them, and writing them up in 
a suitable historical account of the subject, which ultimately grew from one to two 
to three volumes and spanned some 1500 pages [14]. This same Dickson had 
recently managed to secure publication for volume III of his History despite a 
change in presidency at the Carnegie Institution. He had just published his work in 
the arithmetics of algebra. He had now begun to focus his research interests on 
number theoretic topics, increasingly inclined toward problems related to exten- 
sions of Waring’s Theorem. Perhaps more to the point, Cooper met a Dickson who 
had promised the Carnegie Institution, the mathematical world, and himself a 
comprehensive account of number theory, and, thus far, had failed to include 
information on the history of quadratic residues or reciprocity laws. 

Cooper earned his Ph.D. in mathematics in the spring of 1926 under Dickson’s 
guidance, with his historical dissertation “A Topical History of the Theory of 
Quadratic Residues” [9]. The title of Cooper’s dissertation alone suggests a 
connection with Dickson’s larger historical undertaking. Indeed, Cooper wrote this 


>In [16], Dickson claimed that “the study of this challenge problem [Fermat’s Last Theorem] and the 
general law of reciprocity of higher residues led [Ernst Eduard] Kummer to invent his ideal numbers, 
out of which grew the general theory of algebraic numbers, one of the most important branches of 
modern mathematics [p. 161].” This article seems to represent an expanded version of a similar 
discussion in his History, 2, pp. xviii-xix, 739-740. In [21, 324], Harold M. Edwards advances the view 
that the study of higher reciprocity laws and not Fermat’s Last Theorem led Kummer to his study of 
what we know as cyclotomic integers and, ultimately, his ideal numbers. Edwards developed this idea 
more thoroughly in [20, 79-81]. 
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dissertation with the intention that it appear as a chapter in the fourth volume of 
Dickson’s history. Cooper cited earlier volumes of “this History,” as he referred to 
it, more than twenty times in his 98-page thesis. In Cooper’s section on “Number 
and Distribution of Residues,” for example, Cooper asserted that ‘“Dirichlet’s 
fundamental formula for the distribution, in half, quarter, and eighth intervals, of 
the quadratic residues of a positive odd integer P stated in terms of the class 
number of a binary quadratic form of negative determinant are quoted in vol. IIT, 
page 101 of this History” (9, 53-53A, our emphasis]. 

More importantly, at least relative to the study at hand, Cooper’s dissertation 
did not include a history of the law of quadratic reciprocity. Cooper did not 
overlook this “principal” theorem, however. Cooper titled his dissertation appro- 
priately; he wrote a historical account of the theory of quadratic residues. On at 
least sixteen occasions in his dissertation, in fact, Cooper referred to a specific 
chapter on quadratic reciprocity in the purported fourth volume. “For proof 


[of G. Zolotareff’s “unique method” of evaluation of (*) where k, p € Z, 


p a prime, and p does not divide k],” Cooper wrote in his dissertation for 
example, “see chapter on quadratic reciprocity, this History, vol. IV” [9, 35]. He 
_ended the text of his dissertation pointing to the “chapter on the law of quadratic 
reciprocity” and leaving spaces to assign both this chapter a number and the 
results of eight mathematicians page numbers within this chapter. Thus, Cooper 
wrote his dissertation to form a chapter on quadratic residues in Dickson’s fourth 
volume, perhaps in the same spirit as G. H. Cresse’s chapter on the class number 
in volume III.° Dickson’s fourth volume, as Cooper understood it, would contain a 
separate chapter on the history of quadratic reciprocity. Dickson must have had 
this same understanding; he, after all, approved Cooper’s thesis. 

Dickson did more than simply approve Cooper’s thesis at the end; he also gave 
him a boost at the beginning. Cooper described Dickson as an advisor who “not 
only furnished the entire body of original references from which the topical history 
was written, but also took a great deal of personal interest in the preparation of 
the material. My appreciation is particularly due Professor Dickson’ [9, vita, 
immediately following dissertation text]. Although Cooper probably intended that 
this comment serve as an acknowledgement to his thesis advisor, this remark 
indicates that Dickson had amassed a collection of references on the history of 
quadratic residues by the time Cooper arrived at Chicago in 1924. H. S. Vandiver 
further substantiates this claim in a letter he wrote to Cooper on 5 November, 
1925. “I’m sending you under separate cover,’ Vandiver wrote in response to 
Cooper’s request for copies of his articles relating to quadratic residues, “copies of 
my articles which touch on quad-residues ... . Perhaps you know that Dickson col- 
lected references of quad. residues while writing his History” [32]. Since Dickson 
collected quadratic residue references, surely he must have intended to include 
them, in some way at least, in his History. Although it may not have been his initial 
plan when he conceived of his History in 1911, Dickson ultimately entrusted the 
history of quadratic residues to a graduate student, namely, Albert Cooper. 


George Hoffman Cresse earned a Chicago Ph.D. in 1918 under Dickson’s guidance with the 
historical dissertation “On the Class Numbers of Binary Quadratic Forms” [30]. A revised form of this 
dissertation appeared as chapter VI in the third volume of Dickson’s History. The official title page of 
the third volume of Dickson’s History reads, ‘“‘History of the Theory of Numbers, Volume III, Quadratic 
and Higher Forms, By Leonard Eugene Dickson, Professor of Mathematics at the University of 
Chicago, With a Chapter on the Class ‘Number, by G. H. Cresse.” 
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Perhaps the more compelling question raised by this evidence surrounds the 
publication of Cooper’s work. What happened to Cooper’s thesis work on quadratic 
residues? And where is the chapter on quadratic reciprocity? 

After Cooper completed his Ph.D. at Chicago, he rejoined the University of 
Texas mathematics faculty. A year later, in 1927, Dr. Harry Y. Benedict, a former 
classmate from Dickson’s undergraduate days at Texas, assumed the presidency of 
that institution. Dickson gave his view on the election of Benedict to the presi- 
dency of the University of Texas in the alumni publication, The Alcalde. As 
Dickson saw it, “[t]he election of Dr. Benedict as President of the University of 
Texas is particularly fortunate. All are familiar with his success as dean, due to his 
unerring judgement, rare talents as an executive, and deep affection for the 
University. But I wish to emphasize the fact that the man having all these essential 
qualities is also a scientist [astronomer]. This is the age of science...” [4] (and 
partly quoted in [29, 233]). Within months of penning this favorable response to 
Benedict’s election, Dickson personally appealed to Benedict to finance a mathe- 
matical publication. That publication? The fourth and final volume of the History 
of the Theory of Numbers. 

The letter to Benedict came on the heels of a two-month-long exchange of 
correspondence between Dickson and the Carnegie Institution regarding the 
publication of the fourth volume of Dickson’s History. On 1 December, 1927, 
Dickson wrote the Carnegie Institution regarding the recently expressed interest of 
G. E. Stechert & Co. to reprint volumes I and II of Dickson’s History. Dickson 
advised ‘allowing Stechert to reprint” and explained that “[w]e think that even if 
Carnegie Inst[itutio]n could afford to reprint, it would be wiser for it to spend 
same sum to print (short) vol IV of the History and so complete the series and to 
spend the balance on a History of the Solution of Equations” [11]. The administra- 
tive secretary of the Carnegie Institution, W. M. Gilbert, replied that the Institu- 
tion would both prefer Stechert to reprint the two volumes and “be glad to have 
an opportunity to issue your fourth and final volume...” [25]. 

Dickson, however, had more on his mind than reprinting his History and 
publishing his fourth volume. Apparently, since the first of December he had 
outlined a new plan in a series of letters to Gilbert and John C. Merriam, the 
president of the Carnegie Institution. “My suggestion is as follows,” Dickson wrote 
to Gilbert and Merriam at the end of that December, 


[I]et the Institution abandon not only the reprinting of Vols I & II, but also 
the printing of Vol. IV. Instead, let the Institution take on half the burden of 
publishing an entirely new work on the Theory of Numbers (which will meet 
all legitimate needs of a History, but also attend to the more important needs 
of presenting the whole theory of numbers as a science, with emphasis on 
methods). As I wrote before, the University of Chicago Press is committed to 
publishing one of the two necessary volumes... . 

The new work will be incomparably superior to the old History; will be 
what is needed permanently in this field; and will be a fitting sequel to my 
35 years research in this field [12]. 


Thus Dickson himself proposed that the Carnegie Institution no longer plan to 
publish the final, fourth volume of his History, but rather, agree to publish one of 
two new forthcoming volumes on Number Theory. Dickson felt that “an abbrevi- 
ated Vol. IV might be published by some agency other than the Institution 
(& I would so undertake)” [12]. Apparently, over the course of the next month, 
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Dickson determined that his old classmate Benedict and the University of Texas 
might just prove the best possible path of publication for volume IV. 

“My dear Benedict,” Dickson began in his letter of 27 January, 1928, 
“Dr. Cooper has now practically complete his historical ms on ‘Quadratic Residues 
and Reciprocity Law.’ He has done a fine job” [10]. This opening sentence 
indicates that between the spring of 1926, when Cooper earned his Ph.D. from 
Chicago, and January of 1928, Cooper had written “the chapter” on the history of 
quadratic reciprocity. Dickson explained to Benedict that he had originally planned 
to include Cooper’s historical manuscript on quadratic residues and reciprocity law 
in the final, fourth volume of his History. If he carried out this plan, however, he 
would have to include a supplement containing corrections and additions to the 
first three volumes. (In [17, 3: iv], Dickson asked readers to send “errata or 
omissions, which will be published later as a supplement.”) “There are,” as 
Dickson tersely explained it, “reasons against my undertaking [a supplement].” 
“Also,” Dickson added, now shifting the focus from himself to Cooper, “the 
objection that such a vol. IV would not be wholly the work of Cooper” [10]. 
Although Dickson did not cite the source of this “objection,” he seemed to imply 
that Cooper (and, presumably, the University of Texas by association?) would not 
-receive the credit he deserved if his work appeared as chapters in Dickson’s fourth 
volume. It would be better for Cooper, Dickson emphasized, if he published his 
manuscript separately, perhaps, as Dickson suggested, with a subtitle to indicate 
that it formed the fourth volume of his History. 

Dickson told Benedict that the “problem,” as he referred to it, had further 
complications. Specifically, by this time, the first two volumes were out of print. 
Dickson had planned to “replenish” the information in volumes I & II by writing a 
2-volume historical and expository account of the Theory of Numbers. As Dickson 
presented it to Benedict, the University of Chicago agreed to print one of these 
volumes and the Carnegie Institution agreed to print the other. There was a 
condition, however. The Carnegie Institution would print one of these two new 
historical and expository accounts of number theory only if relieved of any 
responsibility for printing a fourth volume in the original historical series. It seems 
Dickson gently twisted the details to attempt to secure publication for both his two 
new works on Number Theory and “his” volume IV. 

Having spelled out all of these details, Dickson proposed “the following best 
plan to serve the interests of Mr. Cooper and mathematics [and himself?]: Let the 
University of Texas publish a book by Dr. Cooper on the History of Quadr[atic] 
Residues and Reciprocity Law. This would close up the gap now existing from lack 
of vol. IV of my History and would take the place of the latter” [10]. Although 
Dickson promised sales for a book by Cooper and urged Benedict to follow the 
lead of other large universities who aided in the publication of serious work done 
by their faculties, we know Benedict did not agree to publish this work of Cooper’s 
as the subtitled fourth volume in the series. We know Benedict did not agree to 
publish the fourth volume, but we do not know why. His reply to Dickson remains 
lost.’ Benedict replied to hundreds of requests, including those of garden clubs 


"Here, by lost we mean that the copy of the letter Benedict sent Dickson is not in any of the 
“natural” or even some of the “unnatural” collections in the Texas Archives. Of course, Benedict could 
have responded to Dickson by phone. As Albert Lewis points out in [29, 207], in the 1920’s university 
presidents began responding to more sensitive and controversial issues by telephone. In these cases, the 
president usually noted “Answered by telephone” at the bottom of the original letter. Dickson’s letter 
bears no such annotation. 
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and small girls schools in south Texas; he must have replied to this letter. Since 
Dickson burned his papers upon his retirement, he probably destroyed the original 
copy of Benedict’s response. 

After January of 1928, Dickson apparently never mentioned volume IV in his 
correspondence with the Carnegie Institution again. Cooper, however, sent a 
telegram to the Institution in June of 1929. announcing that “I have ready the 
manuscript for the fourth volume.” He queried further, ‘““How many copies do you 
think should be printed?” [8] The secretary of the Division of Publications of the 
Carnegie Institution sent Cooper his advice but added that “[iJt seems to me that, 
with the information given here, Dr. Dickson would be the best one to decide this 
matter’ [26]. Although Cooper and Dickson exchanged scores of papers, notes, 
and communiques of various forms on the history of quadratic reciprocity, they did 
not seem to leave any traces of a discussion of the publication of this material [31]. 

The organized and polished pieces of this collection, however, appear to 
represent the page proofs of a book written in the same spirit and style of the first 
three volumes of Dickson’s History. (Perhaps these form the fourth volume Cooper 
referred to in his telegram?) The more loose and ordinary fragments (written on 
“scrap” paper, scrawled on the back of notices from the registrar, etc.) seem to 
provide brief summaries of and references to various articles on quadratic residues 
and reciprocity. The collection contains notes and papers written by both Cooper 
and Dickson. Cooper, however, maintained the collection. It seems unlikely that 
Dickson would have continued to supply Cooper with the information if he 
intended to write this portion of the history himself. Moreover, as Dickson 
indicated in his letter to President Benedict, Dickson could not have written the 
main section on the history of quadratic residues because Cooper had already 
done it. So either Dickson intended to publish the history as a collaborative effort 
with Cooper or, more likely, he saw Cooper as publishing it himself. 

The question now becomes “why did Dickson not forge ahead with the fourth 
volume in some form?” It was totally uncharacteristic of Dickson to leave his work 
incomplete, as it were. He had a plan, at least in late 1927 and early 1928, to see 
this material to press. Cooper still persisted with ideas of publication as late as 
June of 1929.° But the work never appeared. Perhaps, by the late 1920’s, with his 
historical project more than fifteen years old and his research program devoted 
almost exclusively to Waring’s Problem, Dickson found himself completely occu- 
pied with other mathematical endeavors. Maybe his interest waned in the histori- 
cal text, maybe he mentally turned over the fourth volume and its publication to 
Cooper, or, maybe, his “astonishing supply of energy” finally evaporated.’ 

Whatever the case, our historical study leads to an intriguing observation. 
Leonard Dickson sits squarely in the center of this episode in the history of 
(American) mathematics. Yes, Leonard Dickson, the prolific mathematician who 
had a reputation for completeness, for high standards, for excellence—even to the 
point of being impolite when insisting upon these standards [24, 13-14]. And, yet, 
this same Dickson had to come to terms with the non-appearance of this signifi- 


“Aside from the aforementioned telegram from Cooper to the Carnegie Institution in June of 1929, 
we have no other record of Cooper’s attempt(s?) to publish this material. 

Interestingly, in [7, 259], Carmichael subtly hinted at the possibility of Dickson running out of 
steam when he wrote that “[t]he reviewer ventures to predict that the favorable reception of the third 
volume will give the author still more reason for proceeding promptly with the fourth if his astonishing 
supply of energy is holding out well enough to leave him still susceptible to such influence.” 
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cant result in number theory. This set of events in the history of mathematics 
certainly sheds new light on Dickson the mathematician and the man. 

In the end, then, once again, the history of mathematics teaches us that 
mathematics—and mathematicians, for that matter—are more than they appear. 
In particular, mathematical and extra-mathematical factors impinge upon the 
development and publication of mathematics, and the history thereof. In this case, 
clearly, the extra-mathematical factors, in the form of authorship priority, publish- 
ing contracts, and finances, outdistanced the mathematical factors and resulted in 
a highly unusual omission. 
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Uniform Calculus and the Law 
of Bounded Change 


Mark Bridger and Gabriel Stolzenberg 


1. INTRODUCTION. In a recent exchange about the role of the mean value 
theorem in the theory of the calculus, T. Tucker notes that “the origin of the 
Mean Value Theorem in the structure of the real numbers” is much too diffi- 
cult for a standard course [6]. He shows how the increasing function theorem 
(a function with positive derivative is increasing) serves very nicely in place of the 
mean value theorem, and sketches a proof of it from the nested interval property 
of the real number system. 

In support of the mean value theorem, H. Swann recalls its derivation from the 
extreme value theorem (a continuous function on a closed interval has a maximum 
~ value) via Rolle’s theorem and remarks that “such a sequence of arguments reveals 
the charm and power of mathematics, for we prove that a questionable compli- 
cated result must be true if we assume other simpler results that are less 
questionable” [5]. 

We agree with Swann about the charm and power of mathematics and with 
Tucker about the ability of the increasing function theorem to play a role 
traditionally accorded the mean value theorem. In fact, we give several examples 
that support Tucker’s claim. But Tucker and Swann work with pointwise continuity 
and differentiability, weak notions that make proving statements like the increas- 
ing function theorem more difficult. On closed finite intervals, uniform continuity 
and differentiability are as easy to verify, and using them as starting points permits 
a natural development of the calculus in which such difficulties do not arise. 

Our treatment of continuity and differentiation is from our forthcoming book, 
A New Course of Analysis, where it is expressed in terms of a theory of real 
numbers based on interval order and arithmetic. We offer no theory of real 
numbers in this article but we use repeatedly the fact that each real number can be 
approximated by rationals to arbitrary accuracy. 


2. UNIFORM CALCULUS. Continuity. Uniform continuity of a one variable 
function f is a condition on its variation, f(y) — f(x). The condition, written 
d(x, y) > 0 as y—x-— 0 for any two-variable function ¢, is that for each 
e > 0, there is a 6 > 0 such that |d(x, y)| < e€ if ly —x| < 6. When (x, y) = 
U(x, y) — u(x), we also write: U(x, y) > u(x) as y > x. 


Example. The relationship 
i . . . 
iy" —x" = | bye], — x) 
i241 


shows that |y” —x"| <nC"~!|y — x] on any interval of the form [—C, C] and, 
hence, that x” is uniformly continuous on each finite interval. A proof of pointwise 
continuity could hardly be simpler. 
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Example. Using p” + q” <(p+q)" with p =x'/" and q = (y — x)'/", we have 
O<y/"*-x¥" <(y—-x)'"<e if0<x<y andy—x<e’. 


It follows that for each positive integer n, x'/" is uniformly continuous on [0, ~). 


Proposition 2.1. A composition of uniformly continuous functions is uniformly con- 
tinuous. : 


Proposition 2.2. A uniformly continuous function f on a finite interval I is bounded. 


Proof: For € > 0, let 6 > 0 be given by uniform continuity. Because J is finite, we 
can find finitely many points such that every p € J is within 6 of at least one of 
them. Hence, f is bounded by e plus the maximum of its values at these finitely 
many points. = 


Differentiability. Uniform differentiability of a function f also is a condition on 
its variation: it factors as f(y) — f(x) = F(x, yy — x), where F(x, y) > F(x, x) 
as y > x. If f is uniformly differentiable, its derivative is the function f’(x) = 
F(x, x). Thus, F(x, y) = (f(y) -— f(x))/(y — x), for y different from x, and 
F(x, x) = f'(x). 

Because the difference quotient converges to the derivative as y > x, the 
derivative is unique on any domain S for which each x in S is approximable to 
arbitrary accuracy by points y in S different from x. 


Example. For all positive integers n, using the factorization of y” — x” and the 
arithmetic of convergence (see Lemma 4.1), it follows that on each finite interval, 


x" is uniformly differentiable with derivative nx"~'. 


Example. Because y” — x* =(y + xy —x) and y+x > 2x on Ras y > x, x? 
is uniformly differentiable on R with derivative 2x. 


Proposition 2.3. If f is uniformly differentiable, then f’ is uniformly continuous. 


Proof: Because F is symmetric, if x and y are close enough, both f’(x) and f'(y) 
are within e of F(x, y) = F(y, x) and hence within 2€ of each other. a 


Corollary 2.4. On finite intervals, f' is bounded. 
Proof: Propositions 2.2 and 2.3. = 
Proposition 2.5. If f' is bounded, then f is uniformly continuous. 


Proof: When f"’ is bounded, so is F(x, y), say by C, for:|y — x| sufficiently small. 
Hence, |f(y) — f(x)| < Cly —x| ~ 0as y-—x- 0. Pe 


Theorem 2.6. (Fundamental Theorem of the Calculus) If g is uniformly continuous 
on [a, b], then G(x) = {7g(t) dt is uniformly differentiable on | a, b| with G' = g. 


Proof: G(y) — G(x) equals the integral of g from x to y, which equals y — x 


times a limit of averages of values of g at points in [x, y]. (To see this, approximate 
the integral by Riemann sums with equal spacing.) Also, for each e > 0, if |y — x| 
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is small enough, every value of g at a point in [x, y] is within e of g(x). But then, 
also, any limit of averages of values of g at points in [x, y] is within € of g(x), so 
we are done. 


3. THE ARITHMETIC OF UNIFORM CONTINUITY. The arithmetic of uniform 
continuity is very simple. If both f and g are uniformly continuous, so is f + g. If 
also f and g are bounded, then fg is uniformly continuous. Finally, if 1/f is 
defined and bounded, it too is uniformly continuous. 

These statements can be verified by first relating the variations of the sum, 
product and reciprocal to those f and g. Simple algebra shows that var(f + g) = 
var(f) + var(g), var(fg) = g(y)var(f) + f(x)var(g) and var /f) = 
—var( f)/( f(x) f(y). 

Because each expression is a sum of expressions of the form B(x, y)(x, y), 
where B(x, y) is bounded and (x, y) > 0 as y — x — 0, it suffices to verify that 
each such sum again converges to 0 as y —x — 0. We omit the simple proof of 
this. 


4. THE ARITHMETIC OF UNIFORM DIFFERENTIABILITY. If f and g are 
. uniformly differentiable, derivatives for their arithmetic combinations are given by 
the following rules. 


Sums. f+ g is uniformly differentiable with (f+ g) = f’ +g’. 


Products. If f, g, and their derivatives are bounded, e.g., if their domain is a 
finite interval, then fg is uniformly differentiable with (fg) = f'g + fe’. 


Reciprocals. If 1/f is defined and bounded, and f/f’ also is bounded, then 1/f is 
uniformly differentiable with (1/f) = —f'/f’. 


To prove these assertions, we begin by substituting F(x, y)(y —x) and 
G(x, yy — x) for var(f) and var(g) in our expressions for var(f + g), var(fg), and 
var(1/f). For the sum, we get F(x, y) + G(x, y), for the product, g(y)F(x, y) + 
f(x)G(x, y), and for the reciprocal, —F(x, y)/(f(x)f(y)), each multiplied by 
y-x. 

For y = x, these expressions become f’(x) + g'(x), g(x)f'(x) + f(x)g'(x), and 
=f (0) /f- OG): 

The case of the sum is clear. Both F(x, y) — f’(x) and G(x, y) — g’(x) converge 
to 0 as y—x-— 0, hence so does the sum. For the product and reciprocal, 
multiplications are involved. The following lemma gives us what we need to deal 
with them. 


Lemma 4.1. Suppose that u and v are bounded. If U(x, y) > u(x) and V(x, y) > 
v(x) as y > x, then for y —x sufficiently small, U and V are bounded and 
U(x, yV x, y) > u(x)v(x) as y > x. 


Proof: Write UV — uv = uV — v) + (U — u)V and note that, because uv and V 


are bounded for y — x small enough, each summand converges to 0 as y —x — 0. 
a 


The next lemma is used to prove Proposition 6.1 about the differentiability of an 
inverse function. 
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Lemma 4.2. Suppose that 1/u is defined and bounded. If U(x, y) > u(x), then for 
y — x sufficiently small, 1/U is defined and bounded, and 1/U(x, y) > 1/u(x) as 
yx. 


Proof: We prove only the second part. Write 1/u — 1/U = (U — u)/uU and note 
that 1/uU is bounded for y — x sufficiently small. = 


For the product rule, we reason as follows. By assumption, f is bounded and 
G(x, y) > g(x) as y>x. Thus, f(x)G(x, y) > f(x)g'(x) as y > x. Because 
limits add, it suffices to prove that g(y)F(x, y) > g(x)f'(x) as y > x. But, also by 
assumption, F(x, y) > f’(x) as y > x, and f’ and g are bounded. Hence, if 
g(y) > g(x) as y > x, we can apply Lemma 4.1. It therefore suffices to note that 
g is uniformly continuous because g’ is bounded. 

Similarly, for the reciprocal rule, f is uniformly continuous because f’ is 
bounded, and because 1/f is bounded, it too is uniformly continuous. Hence, 
1/fly) ~ 1/f(x) as y > x. Multiplying by 1/f(x), we see that 1/f(x)fly) > 
1/f?(x) as y > x. Because —F(x, y) > —f’(x) as y > x, and the limit functions 
1/f* and —f’ are bounded, the product converges to the product by Lemma 4.1. 


5. THE CHAIN RULE 


Proposition 5.1. If f and g are uniformly differentiable, and if f' and g’ are bounded, 
then f(g) is uniformly differentiable with derivative f'(g)g'. 


Proof: Because f(g(y)) — f(g(x)) = F(g(x),g(y Me(y) - g(x)), which in turn 
equals F(g(x), g(y))G(x, yy — x), the candidate for the derivative of f(g) is 
indeed f’(g)g’. Because g’ is bounded, g is uniformly continuous. Hence, 
F(2¢(x),2(y)) > f'(g(x)) as y > x. Because G(x, y) > g(x) as y > x and f(g) is 
bounded, an application of Lemma 4.1 gives us the desired result. = 


6. DIFFERENTIABILITY OF THE INVERSE 


Proposition 6.1. If h is a uniformly continuous inverse for f, and if 1/f’ is defined 
and bounded, then h is uniformly differentiable with h' = 1/f'(h). 


Proof: Because h is an inverse for f, we can factor the variation of the identity 
function as 


y—x=f(h(y)) — f(ACx)) = FCACx), ACY) ) (ACY) — AC*)). 
This shows that 1/F (h(x), h(y)) is equal to the difference quotient for h when 
ly — x| > 0. Because h is uniformly continuous, F(h(x), h(y)) > f’(A(x)) as y — x 
— 0. Therefore, because 1/f’(h(x)) is defined and bounded, we can apply Lemma 
4.2 to conclude that 1/F(hA(x), h(y)) > 1/f’(h(x)) as y > x. he 


7. THE LAW OF BOUNDED CHANGE 


Theorem 7.1. [ff is uniformly differentiable and A < f' < B on[a, b], then A(b — a) 
< f(b) — fla) < Bib — a). 


This is the law of bounded change. It says that bounds for the derivative are 
bounds for the difference quotient. Notice that the increasing function theorem is 
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just the law of bounded change for A = 0 (and we don’t care about B) and the law 
of bounded change is the increasing function theorem applied to the functions 
Bx — f(x) and f(x) — Ax. 


Proof: It suffices to prove that for all e > 0, the conclusion holds with A and B 
replaced by A — € and B + e. The justification for this is the general truth that if 
p<q+e forall e> 0, then p <q. That this holds for reals follows by rational 
approximation from the fact that it holds for rationals. 

Since F(u,v) > f’(u) as v > u, for each € > 0 there is a 6 > 0 such that 
f'(u) — € < Flu,v) < f’(u) + € for0 < v—u < 6. But A < f'(u) < B, so f(v) - 
flu) = Flu, vv — u) lies between (A — €)(v — uw) and (B + eu — u). 

Hence, if we express f(b) — f(a) as a telescoping sum of n differences f(u,) — 
f(u;_,), where uy =a and each u,; —u;_, =(b-—a)/n < 6, we have that 
(A — eb — a) < f(b) — f(a) < (B+ eb — a). a 


We now draw several useful and easy consequences of the law of bounded 
change. 


Corollary 7.2. f is constant on any interval on which f' = 0. 
Proof: This is just the law of bounded change with A and B equal to 0. i 


Is there any simpler or essentially different way to prove this deceptively obvious- 
looking fact? 


Corollary 7.3. f(x) — f(a) = [* f(t) dt. 


Proof: By the fundamental theorem of the calculus, the two sides of the equation 
have the same derivative. Hence, by Corollary 7.2, they differ by a constant. But 
they agree at x = a, so they agree everywhere. Ds 


Alternatively, we can observe that in the proof of the law of bounded change, we in 
effect approximate f(x) — f(a) to arbitrary accuracy by Riemann sums for the 
integral of f’ from a to x. Because these sums also approximate the integral, the 
two must be equal. 


Corollary 7.4. If f’ >A > 0 on[a, b] and f(h(u)) = u for all u in [ f(a), f(b)], then 
h is uniformly continuous. 


Proof: By the law of bounded change, if h(u) < h(v), then A(h(v) — h(u)) < 
f(h(v)) — fh) = v — u. S00 < hv) — hu) < (vu -—u)/A ~ Oasv-u-0. & 


By the inverse function theorem, whenever f’ > A> 0 on [a,b], there is a 
function /: as in the statement of Corollary 7.4. 


Corollary 7.5. If A < f’ < Bon [a,b], then 


fly) — f(*) 


—f'(u)| <B-A_ forallx <yandalluin {a,b}. 
yx . 
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Proof: We apply the law of bounded change on [x, y]. Because the values of f’ are 
in [A,B], so is the difference quotient (f(y) — f(x))/(y — x), which therefore 
cannot differ from any value of f’ by more than B — A. = 


Corollary 7.6. If f is uniformly differentiable on all sufficiently small subintervals of an 
interval I and if f' is uniformly continuous ont, then f is uniformly differentiable on I. 


Proof: For € > 0, the values of f’ lie between f’(x) — € and f’(x) + € on each 
sufficiently small [x, y] in J. Therefore, if f(y) — f(x) = F(x, y)\(y — x), Corollary 
7.5 shows that |F(x, y) — f’(x)| < 2e for |y — x| sufficiently small. a 


The next consequence of the law of bounded change is needed for L’H6pital’s 
Rule. In it, A and B are constants, and f and g are uniformly differentiable on 
La, bl]. 


Corollary 7.7. (Generalized Law of Bounded Change) If Ag’ < f' < Bg’ on [a, b], 
then A[g(b) — g(a)] < f(b) — fla) < Blg(b) — g(a). 


Proof: Apply the increasing function theorem to Bg —f and f — Ag, and rear- 
range the resulting inequalities. = 


8. APPLICATION: L’HOPITAL’S RULE. We now present a few examples in 
support of Tucker’s contention that the increasing function theorem serves nicely 
to prove major theorems of the calculus that traditionally are derived from the 
mean value theorem [6]. We begin with L’H6pital’s Rule; see also [2]. There are 
two cases. In both, we assume that f and g are defined on a semi-infinite interval 
[c,) and are uniformly differentiable on each finite subinterval. We assume also 
that g and g’ are positive. 


Proposition 8.1. Jf f(x) and g(x) > 0 and f'(x)/g'(x) > L as x > &, then also 
f(x) /g(x) ~ Lasx > ~&, 


Proof: For «> 0, L —¢€<f'/g'’<L +e on[p,©) if p is large enough. In that 
case, if p <x < y, the generalized law of bounded change ensures that 


(L — €)(g(x) —g(y)) s f(x) -—fly) s (£2 + ©)(8(*) - 8()). 


Because weak inequalities are preserved in the limit, if we let y ~ ~ and divide by 
g(x) > 0, we obtain L — € < f(x) /g(x) < L + € for all x = p. z 


In the second case of L’Hopital’s Rule, it is common to assume also that 
f(x) > ©, but there is no need to do so. 


Proposition 8.2. [f g(x) — © and f'(x)/g'(x) > L as x > &, then also f(x)/g(x) 
> Lasx > ~, 


Here too, the generalized law of bounded change is used only once. We note 
that if f’(x)/g'(x) lies between L — €/2 and L + €/2 for x => p, then so does 
(f(x) — f(p))/(g(x) — g(p)). But to complete the argument, one has to be more 
artful than in the first case.° 
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9. APPLICATION: DIFFERENTIATION UNDER THE INTEGRAL SIGN 


Definition 9.1. A two-variable function f is uniformly continuous if for each e > 0, 
there is a 6 >0 such that |f(x’, y’) — f(x, y)| < € whenever both |x’ — x| and 
ly’ — y| are smaller than 6. 


Remark 9.2. If the second coordinates in Definition 9.1 are equal, then the 
condition that guarantees that |f(x’, y) — f(x, y)| < e involves only the first coor- 
dinates: |x’ — x| < 6. That is, for each e€ > 0, one 6 > O works for all horizontal 
lines y = constant. This simple observation is a key to our proof of Theorem 9.3. 


Theorem 9.3. (Differentiation Under the Integral Sign) Let f be defined on Q = 
[a, b] X [c,d] and uniformly continuous on {a} X [c,d]. If f is uniformly differen- 
tiable on each [a,b] X {y} and its partial derivative f., is uniformly continuous on Q, 
then f is uniformly continuous on Q and the integral of f., over [c, d] is a derivative for 
the integral of f over [c, d]. 


Proof: We assume the uniform continuity of f on Q. It can be be proved fairly 
easily using Corollary 7.3 but we prefer to focus here on the second part of 
.the argument, which employs a less familiar application of the law of bounded 
change. Integrating f(y, t) — f(x, t) = F(x, y, t)\(y — x) over [c, d], we see that to 
complete the proof, it suffices to demonstrate that the integral of F(x, y,t) — 
f(x, t) over [c, d] converges to 0 as y — x — 0. To this end, it suffices to show that 
|F (x, y, t) — f(x, t)| can be made less than any e€ > 0 by making |y — x| less than 
some 6 > 0, independent of ¢. 

It follows from Corollary 7.5 that |F(x, y,t) — f,(x, t)| is bounded for each t 
by any bound for |f,(v, t) — f,(u, t)|, for all u and v in [x, y]. By Remark 9.2, for 
any € > 0, there is a 6 > 0 such that if |x —y| < 6, then, for all ¢ in [c,d], 
If (v, ) — fu, t)| < e for all u and v in[x, y]. This is precisely what we need. 


Proposition 9.3 also follows easily from reversal of order of integration and 
Corollary 7.3. Because reversal of order of integration is a simple consequence of 
the existence of the double integral of a uniformly continuous function, this 
provides a proof of Proposition 9.3 that uses the law of bounded change in a more 
familiar way. 


10. HIGHER DIMENSIONS. In higher dimensions, there is no obvious counter- 
part to the increasing function theorem, and the mean value theorem is false even 
for a mapping from an interval to R*. Yet the law of bounded change generalizes 
almost without alteration if we regard convex sets as higher dimensional counter- 
parts to intervals and read the proof as showing that if f 1s defined on an interval 
[a, b], then any closed interval that contains {f’(u)(b — a): u € [a, b]} also con- 


tains f(b) — f(a). 


Definition 10.1. A map f from a subset U of one normed linear space X to 
another Y is uniformly differentiable if there is a map Df from U to the set of 
bounded linear transformations from X to Y such that for each e > 0, || f(g) — 
f(p) — Df(p\q - Dily < ella — pllx if llg — pllx is sufficiently small. 


Proposition 10.2. Using the notation in Definition 10.1, if U is convex then, for each 
p and q in U, f(q) —f(p) belongs to every convex subset of Y that contains 
Df(uXq — p) for all u in U. Hence, if each Df(u): X — Y is bounded by K on the 
unit sphere of X, then ||f(q) — f(plly < Kllq — pllx. 


634 UNIFORM CALCULUS [Monthly 106 


Proof: If p and gq are in U, so is the line segment joining them. Hence, using a 
telescoping sum as in the proof of Theorem 7.1 and approximating each summand 
by the value of Df at a point on the segment, applied to (g — p)/n, we can 
approximate f(q) — f(p) to arbitrary accuracy by an average of finitely many 
values of Df at points along the segment, applied to g — p. m 


11. AFTERWORD. We believe that this development, which is in the construc- 
tivist manner of Errett Bishop and L. E. J. Brouwer [4], produces proofs that are 
shorter and more transparent than those encountered in classical treatments. The 
idea of working with uniform rather than pointwise notions is a hallmark of the 
constructivist tradition. 

For the one-dimensional case, our definition of differentiable function is a 
uniform version of a definition of Carathéodory. See [3] and the references 
therein. For a definition of this kind in higher dimensions, see [1]. 
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The Answer is 2” - 2! What’s the Question? 


Gary Gordon 


1. INTRODUCTION. Allison, Cory, Heidi, and Zach are relaxing in the math 
coffee room. Allison says, “Did you see Jeopardy last night? I’m always excited 
when they have Mathematics as a category, but their math questions are always so 
lame!” Cory and Zach agree with Allison. Heidi says “Why don’t they ever have an 
answer like 2”!? Then the contestants would have to find a problem with 2”n! as 
the answer. That might not be so easy, even for Alex.” 

Cory says “I’ve got it: What do you get when you multiply 2” and n!?” 

Ignoring Cory’s correct but uninteresting solution, Allison continues, “Since 
we’re mathematicians, our goal should be to find really good questions, using all 
sorts of math. We should each find our own question, one that uses our own field. 

Remember, we’re trying to find good questions having 2”n! as the answer.” 


The rest of this paper is devoted to the approaches taken by these four 
(fictitious, but aptly named) mathematicians. These approaches are from three 
different fields: algebra, geometry and combinatorics. This allows us to see the 
same problem from four different viewpoints, with each successive explanation 
encompassing the previous ones. 

The algebraic and geometric approaches given here are well known, though 
perhaps not as widely known as they should be. The combinatorial approach is 
recent, incorporating a graph theory algorithm to produce the desired connections 
among these viewpoints. 

In the hope of making this note more readable, the proofs are frequently 
sketched broadly. We hope enough details remain for the interested reader to fill 
in any missing steps (or look them up). 


2. ALLISON’S TURN. Allison’s specialty is algebra; she decides to find a group 
with 2”n! elements. She needn’t look far; the full symmetry group of an n-dimen- 
sional hypercube will do. There are n? mirror symmetry hyperplanes for an 
n-dimensional hypercube. One way to see this is to place the 2” vertices of the 
hypercube at the points (+1,...,+ 1). Then the reflections of the hypercube 
occur in the hyperplanes whose equations are: 


¢ x; = 0 (the n coordinate hyperplanes), 
¢ x; =x, (there are ("] of these), and 


¢ x; = —x, (there are ("] of these, too), 
where 1 <i <j <n. See Figure 1 for a picture of the cube and its nine planes of 
reflection. In this figure, x, = 0 (for 1 <i < 3) corresponds to a plane that slices 
through the middle of four faces of the cube, while x; =x, and x; = —x, @ #j) 
correspond to planes that contain the diagonals of two opposite faces and two 
edges of the cube. 
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Figure 1. A cube and its planes of symmetry. 


Let H,, denote the n-dimensional hypercube and let G,, be its symmetry group. 
Allison needs to show there are precisely 2”! elements in G,,. She decides to give 
an inductive argument. When n = 1, the hypercube is just a line segment, which 
has symmetry group Z,. In general, H, has 2n facets (faces of dimension n — 1, 
each of which is the hypercube H,,_,). Allison constructs an arbitrary symmetry of 
H,, as follows: 


1. Choose one of the 2 facets of H, and move the hypercube so that this facet 
is on the bottom; the bottom of the hypercube is the facet that meets the 
x,-axis at the point (0,...,0, —1). 

2. Use an element of G,_, as a symmetry of this bottom face; in general, this 
symmetry permutes the other facets of H,. 


In 3 dimensions, this corresponds to picking one of the 6 faces of the cube for 
the bottom, then using one of the 8 symmetries of the square on this bottom face. 
This gives 48 symmetries, only 24 of which can be realized by rigid motions. The 
remaining symmetries are either reflections (there are 9, as we have already seen) 
or rotary reflections, i.e., reflections followed by rotations (there are evidently 15 of 
these). It is an interesting exercise to find explicit descriptions of these 15 rotary 
reflections for the cube. 

Every symmetry of H, can be obtained in this manner, since any symmetry must 
carry facets to facets. Then |G,| = 2n|G,_,|, which, together with the initial 
condition |G,| = 2, gives |G,,| = 2”n! 


Allison’s algebraic solution. Let G, be the symmetry group of an n-dimensional 
hypercube. Then the order of G,, equals 2”n! 


This approach gives us our answer in one way, but gives us very little informa- 
tion about the structure of the group G,. We investigate the group in a bit more 
detail in the concluding section. 


3. HEID?’S TURN. Hyperplane arrangements form the background for the follow- 
ing classic problem, a favorite in mathematics contests: 


¢ What is the largest number of regions produced when n lines are drawn in the 
plane? 
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This problem is also useful when introducing mathematical induction. The 
3-dimensional version appeared as MONTHLY Problem E554 [8], where J. L. 
Woodbridge of Philadelphia asked: 


¢ Show that n cuts can divide a cheese into as many as (n + 1)(n? — n + 6)/6 
pieces. 


Both of these questions are answered by a general formula discovered by L. 
Schlafli (published posthumously in 1901): 


e The largest number of regions produced when n hyperplanes are drawn in 
d-dimensional space equals a), 


Counting the regions of various hyperplane arrangements is the beginning of a 
beautiful subject, with deep ties to algebra, topology, and combinatorics. A classic 
reference is [7]. Heidi has studied hyperplane arrangements and thinks she can use 
them to construct a good question. Since the algebraic approach has a strong 
geometric flavor, her first solution is to copy Allison’s solution, without using 
groups. Recall that Allison used hyperplanes to understand the group G.,. If Heidi 
«simply uses the same collection of hyperplanes Allison used (without even men- 
tioning the hypercube), she will get a dissection of space into open n-dimensional 
regions. How many regions are produced by this hyperplane arrangement? 

Thus, Heidi is concerned with counting the regions of the hyperplane arrange- 
ment whose equations are x; = 0, x; =x,;, and x; = —x;. Heidi needs to show that 
the number of regions is 2”n!. 

What do these regions look like? Heidi finds it is easier to imagine the regions 
by intersecting them with a cube, as in Figure 1. Then a typical region is formed as 
follows: Let O = (0,0, 0) be the center of the cube, let P, = (1, 0,0) be the center 
of a face, let P, = (1,1,0) be the center of an edge adjacent to this face, and 
finally let P; = (1, 1,1) be a vertex adjacent to this edge. Then the region is the 
tetrahedron whose vertices are the four points O, P;, P,, and P,. There 6 choices 
for P,, 4 choices for P,, and 2 choices for P;, giving us 48 regions. 

In general, Heidi produces a region by picking the center of the hypercube, then 
picking the center of one of its 2” facets, then picking one of the 2(n — 1) faces (of 
dimension n — 2) of the chosen facet, and so on. At stage k, she chooses the 
center of one of the 2(n — k) facets surrounding an n — k-dimensional face of the 
hyperplane. The n + 1 points produced are the vertices of a simplex; this simplex 
is the intersection of a region of the hyperplane arrangement with the hypercube. 
See [2, §7.6] for more on this approach. 

The number of regions of the arrangement is just 2m -2(m — 1)-+4-2 = 2n!, 
so Heidi proudly announces 


Heidi’s hyperplane solution. Let A, be the hyperplane arrangement given above. 
Then A, decomposes space into 2”n! regions of dimension n. 


Heidi also wants to relate her approach to Allison’s approach. She does so by 
exhibiting a one-to-one correspondence between the regions of the arrangement 
and the elements of the symmetry group G,,. To see this correspondence, first note 
that the symmetry group G,, acts on the regions of the arrangement. Now given any 
pair of regions A and B in her hyperplane arrangement, there is a unique element 
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O43, 6G, with o, ,(A) =B (G, acts transitively on the arrangement). The 
existence of o, , follows because the regions themselves have no non-trivial 
symmetries—they are completely asymmetric. Then labeling an arbitrary region of 
the arrangement by the identity J of the group, Heidi gets her bijective correspon- 
dence between the regions A,, A,,... and the elements o; 4,0), 4,,... Heidi has 
constructed a portion of the Cayley graph of G,—the arrangement itself is 
equivalent to the geometric dual of the Cayley graph. See [2] for more on Cayley 
graphs. 


4. ZACH’S TURN. If M is any r Xk matrix, then the zonotope Z(M) is a 
polytope in &’ given by 


k 
= 


where c, is the i column of M. Since the c, are just an arbitrary collection of 


vectors in &’, the study of zonotopes is very natural in geometry or linear algebra. 
It is also fun to build 3-dimensional models of them using a good kit; Polydrons 
and Zometools are both good choices. 

The hypercube H, is a product of n intervals: H, =[—1,1] x --- x [-1, 1]. 
Zonotopes are projections of hypercubes; the definition shows how each interval 
[—1,1] appears. The name zonotope arises from the fact that the facets determine 
‘zones’ in space. Zonotopes are an important and well-studied class of polytopes 
with applications to oriented matroids, tiling problems and more. See [1], [2], and 
[14] for interesting connections among zonotopes, groups, geometry and oriented 
matroids. 

Zach knows zonotopes; his idea is to find a class of zonotopes in which 2”n! 
counts something. An obvious choice for a matrix whose columns generate the 
zonotope (given the first two solutions) is the matrix of normal vectors of the 
hyperplanes considered previously; x; = 0, x; =x,;, x; = —x,. For n = 3, a picture 
of the zonotope appears at the bottom of Figure 2. This polytope is a rhombitrun- 
cated cuboctahedron, one of the Archimedean solids. This zonotope arises as a 
truncation of a cube; the vertices of the cube correspond to the hexagons of the 
zonotope, the edges of the cube correspond to the squares and the faces (squares) 
of the cube correspond to the octagons. See Figure 2 for an illustration of how the 
truncation process produces the zonotope. 

Zach constructs the n X n*? matrix M, whose column vectors are normal to the 
hyperplanes (yes, the same hyperplanes Heidi used), and he calls the associated 
zonotope Z(M,,). In general, order the columns of M,, as in the following example: 


1 0 oO 414 1 1 1 +0 0 
M,=|0 1 O 1 -1 0 060 1 1 
, G2 & © O@ Det & 


Zach now just counts the number of vertices of the zonotope Z(M,,). Superim- 
posing a picture of Z(M,) and the hyperplane arrangement A, Zach realizes that 
there is a one-to-one correspondence between the vertices of Z(M,) and the 
regions of the arrangement (see Figure 3). Note that each vertex of Z(M,) appears 
in the center of a region of the arrangement. Generalizing from 3-dimensions to 
n-dimensions (frequently dangerous but also frequently productive), he guesses 
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Figure 3. A zonotope and its planes of symmetry. 


Zach’s zonotope solution. The zonotope Z(M_,,) has 2”n! vertices. 

To see why this is true in general, Zach uses a little linear algebra. Here is a 
sketch of his ideas: First, he realizes that if the vector v in &” is a vertex of the 
zonotope, then v = Mp, where p is a column vector, each of whose n” entries 
equals 1 or —1. This follows from remembering that Z(M,) = X.,A,c,, where 
—1 <A, < 1;a vertex of Z(M,) can ensue only when A, = 1 or —1. There are 2” 
such +1 vectors p, representing the vertices of an n*-dimensional hypercube, the 
hypercube that is projected to-Z(M,). 
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Each of these 2” vertices of the hypercube produces a vector v that is now a 
possible vertex of Z(M_,). Which v are the zonotope vertices? Zach figures out that 
any vector whose entries are some permutation of {la,,3a,,5a;,...,(@2n — 1)a,}, 
where each a, equals 1 or —1, arises as Mp for some vector of +1’s. 

Why do the vertices look like this? Every row of the matrix M, has exactly 
2n — 1 non-zero entries. Thus any coordinate of Mp is bounded above by 2n — 1 
and below by —(2n — 1). Once p has been chosen to make a certain coordinate of 
M,,p equal to +(2n — 1), the next largest (or smallest) coordinate value possible is 
+(2n — 3). The argument proceeds inductively; see [3] for a complete proof. 

Conversely, each such v must be a vertex of the zonotope, since no such vector 
is a convex combination of other points in Z(M,,). In particular, the matrix product 
M,p can never produce any entry larger than 2m — 1 in absolute value. Thus, if 
+(2n — 1) appears as an entry in v, then v could not be a non-trivial convex 
combination of other vertices of the zonotope. Finally, no other v is a vertex of 
Z(M_,), since no other v contains +(2n — 1) as an entry. 

The key for us is simply the following: since there are n! permutations of 
[1,3,...,(2n — 1)] and 2” possible ways to assign +1 to each entry, Zach has 
produced 2”n!, as required. 


5. CORY’S TURN. Zach did his job, but he ducked an important question in 
considering the matrix products M, p: 

Question 1. Of the 2” possible sign vectors p, which ones produce the 2”! 
vertices of the zonotope Z(M,,)? 


The job of tying all the pieces together falls to Cory, our combinatorialist. Cory 
realizes that finding a question whose answer is 2”n! is not much of a challenge; he 
could just find all permutations of [+1, + 3,..., + (2n — 1)] (as in Section 4) and 
be done. This involves combinatorics only superficially, however; he seeks a deeper 
connection, as do we. 

The matrix M, that Zach used reminds Cory of incidence matrices. Remember 
that every column of M,, has either exactly one non-zero entry (which equals 1) or 
exactly two non-zero entries (which either are both equal to 1 or have one entry 
equal to 1 and the other equal to — 1). Cory decides to create a graph that has this 
matrix as its vertex-edge incidence matrix. Here’s how the graph is defined: First, 
he labels n vertices of the graph he (modestly) calls C,, with the numbers 1,..., 7 
(corresponding to the rows of the matrix). Then the edges of the graph are filled in 
as follows: 


1. Put a loop at every vertex (corresponding to the columns having one 1 and 
n — 1 0's), 

2. Put an edge between every pair of vertices (corresponding to the columns 
having two 1’s and n — 2 0’s), 

3. For each i and j such that 1 <i <j <n, put a directed edge pointing from 
vertex i to vertex j (corresponding to the column having a 1 in position /, 
a —1 in position j and n — 2 0’s). 


Technically, C,, is a mixed graph because it mixes directed and undirected edges. 

Cory has talked to Zach and knows about the connection between the matrix 
M,, and the zonotope Z(M,,). In order to connect the graph C,, with the zonotope, 
he needs to interpret Mp, where p = [e,,...,@,2]’ and each e, = +1. He decides 


to use the entries of p to label some of the edges of C,, with +1. If column &k of 
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M,, corresponds to an undirected edge in C,,, Cory labels this edge with e,, which 
equals either 1 or —1. Similarly, if column & corresponds to a loop in C,, he labels 
the loop with e,. Finally, if column k of M, corresponds to a directed edge from 
vertex 2 to vertex j, he leaves this directed edge unchanged if e, = 1 and he 
reverses the direction if e, = —1. 

This process produces a signed, oriented mixed graph C,(p). See Figure 4 for an 
example with M, and the vector p = [1, —1)1, —1,1,1, —1,1, —1]’. To under- 
stand how this p produces the signs and arrows shown, recall the order of the 


Figure 4. Signed mixed graph G,(p). 


normal vectors appearing as the columns of M,: The first three columns corre- 
spond to the loops at vertices 1, 2, and 3. Columns 4, 6, and 8 correspond to the 
(undirected) edges joining vertices 1 and 2, 1 and 3, and 2 and 3, respectively. 
Thus, for example, since e, = —1, the edge joining vertices 1 and 2 is signed with 
—1. Columns 5, 7, and 9 correspond to the directed edges joining vertices 1 and 2, 
1 and 3, and 2 and 3, respectively. Thus, for example, since e, = —1 in the 
example, the edge directed from vertex 1 to vertex 3 is reversed. 

There is a one-to-one correspondence between signed, oriented mixed graphs 
and the sign vectors p. It is interesting to note that these sign vectors also arise 
naturally in the context of the hyperplane arrangements—Heidi’s hyperplane 
approach to the problem. For any region R in the hyperplane arrangement, let v 
be a point in R. Thinking of v as a vector, compute the usual inner product of v 
with each of the n* normal vectors. Recording only whether these inner products 
are positive or negative produces a sign vector of length n’; this simply records 
which side of each hyperplane the point v lies. Among all possible his potential 
sign vectors that could arise in this way, it turns out that only 2”n! do arise. These 
are precisely the sign vectors that produce vertices of the associated zonotope. 

As anyone who has ever put up wallpaper knows, it is easy to move a bubble 
beneath the paper from place to place, but it’s hard to get rid of the bubble. Cory 
has now shifted the problem of determining the 2”n! vertices of the zonotope (or 
the regions of the hyperplane arrangement) to a graph theory problem. 


Question 2. Of the 2” possible oriented, signed mixed graphs C,(p), which ones 
produce the 2”n! vertices of the zonotope Z(M,,)? 


Cory’s goal is to separate the two factors 2” and n! by having each one count a 
particular action. Now there are two aspects to C,(p): the orientation of the 
directed edges and the signing of the undirected edges. Cory decides to call a sign 
vector p good if it corresponds to a vertex of the zonotope. Cory notices a few 
things about these good p: First, the orientation of C,(p) is acyclic when p is good. 
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This means that the vertices can be linearly ordered v,,v,,...,u, so that every 
directed edge adjacent to v, points away from u,, every directed edge adjacent to 
v, (except for the edge from v, to v,) points away from v,, and so on. In the 
language of tournaments, the directed edges of C,,(p) form a transitive tournament 
—if a beats b and b beats c, then a beats c. There are n! acyclic 
orientations—Cory is half-way home. 

The second important point Cory realizes has to do with the signs on the 
undirected edges. When p is good, these signs can be obtained ‘vertex by vertex’ as 
follows: Assume v,,v,,...,v, iS the linear order and let b = [b,,...,5,]' be one 
of the 2” vectors of 1’s and —1’s. If 5, = 1, then he signs with a 1 all undirected 
edges incident to v, (including the loop); if b, = —1, then all undirected edges 
(including the loop) incident to v, are signed with a —1. Now Cory removes uv, or 
v, from the list (depending on whether b, = 1 or —1), giving a new list of n — 1 
vertices. He then repeats the process for b,: if b, = 1, then he signs with a 1 all 
undirected edges incident to the first vertex on the new list (v, when b, = —1 and 
v, when b, = 1) Gncluding the loop) that have not previously been signed; if 
b, = —1, then he signs with a —1 all undirected edges (including the loop) 
incident to the last vertex on the list (v, when b, = 1 and v,_, when b, = —1) 
that have not previously been signed. The process continues until all of the signs 
b, — b, have been processed. Since there are 2” sign vectors b and each one gives 
a unique signing, Cory has the 2” factor directly visible, too. 

For example, if b = [1, —1,1]’ and v, = 3,v, = 1, and v; = 2, he first paints 
the edges adjacent to vertex 3 with 1’s, then paints the previously unsigned edges 
adjacent to 2 with —1’s and finally paints the remaining unsigned edges adjacent 
to 1 with a 1; the only edge painted in this last step is the loop at vertex 1. See 
Figure 4. 

Cory gives an inductive argument to show why this works; you can see it in [3]. 
The key for us is that this procedure for producing an oriented signed graph gives 
a direct link to our 2”n! problem family: Since there are n! orderings of the n 
vertices (to produce an acyclic orientation) and there are 2” sign vectors b of 
length n, Cory immediately gets the answer of 2”n!. 


Cory’s combinatorial solution. There are precisely 2”n! oriented, signed mixed 
graphs C,(p) corresponding to good sign vectors p. 


6. REBUTTAL. Allison, Heidi, Zach, and Cory meet in the coffee room to share 
their work. Allison sees a connection between the structure of the group G, and 
the ubiquitous 2”n! 


Allison’s turn. Allison knows that the symmetry group G,, of a hypercube decom- 
poses as a Semi-direct product: 


G, = 2, %58,. 
She views the symmetry group of a hypercube as follows: 


¢ First label the 2n facets of the hypercube by the symbols 1, 1*, 2,2*,...,, n*, 
where the symbol i represents the facet contained in the hyperplane x, = 1 
and i* represents the facet contained in the hyperplane x, = —1. 

¢ Choose a vertex v of the hypercube and record the ordered list of n facets 
incident to v. The starting point for the list can be determined uniquely by 
choosing the top or bottom facet first (whichever is incident to v) and fixing 
an orientation in space- 
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e Then an arbitrary symmetry of the hypercube can be broken down into two 
steps: First choose a permutation of the labels around the vertex v, then map 
v to some other vertex vu’. 


Allison explains to her colleagues: “The first step in this procedure can be 
accomplished by composing certain reflections through v (the collection of all 
reflections through v generates the symmetric group S,,), while the second step 
amounts to choosing an element from the normal subgroup Z5 to move from v to 
v' (accomplished by conjugating the permutation by the appropriate element of Z5, 
which is generated by the reflections perpendicular to the coordinate axes). Hey! Is 
anyone still awake?” 


Cory’s turn. Cory sees that the graph algorithm that produces the good sign vector 
p does (to a graph) more or less the same thing Allison just did (to the group G,,). 
To understand how the correspondence works, Cory reminds his colleagues that he 
first chooses an acyclic orientation of C,,, resulting in a permutation of the n 
vertices (which corresponds to the permutation of the n facets around the vertex 
v), then he picks a sign vector b of length n (which corresponds to mapping the 
- vertex v in the hypercube to some other vertex). This gives a map of the 2”n! 
elements of G,, to the collection of all C,(p) that are formed from good p’s. 
Heidi adds, “It’s really interesting how the four approaches used different areas 
of mathematics, but are so closely related. Let’s send our solutions to Jeopardy!” 


7. CONCLUDING REMARKS. The connections between hyperplane arrange- 
ments, zonotopes, and symmetry groups is explored in [2]. Further interpretations 
are examined in [5]. The sign vectors p considered here are maximal covectors in 
an associated oriented. matroid [1]. The further connection with acyclic orienta- 
tions of ordinary graphs is due to Curtis Greene [4], while the extension to mixed 
graphs appears in [3]. 

Signed graphs also provide a good way to understand the combinatorics of 
hyperplane arrangements. Tom Zaslavsky has developed a substantial theory for 
these combinatorial objects. See [11], [12], or [13] for a sample of this work. Many 
of his results generalize to other hyperplane arrangements. Zaslavsky has also 
considered hyperplane arrangements from a matroidal viewpoint. A very readable 
account appears in [10]. 

Many of the arguments given here can be rephrased using matrix groups; the 
reflections in the group G,, are easily represented by matrices. See [2] for more 
details on using linear algebra in this way. 


ACKNOWLEDGMENT. Liz McMahon’s comments substantially improved the exposition. Figures 1, 2, 
and 3 were created by the computer program KaleidoTile. The author also thanks Allison, Heidi, Cory, 
and Zach for their names. 
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Conditional Convergence of Infinite Products 


William F. Trench 


In this article we revisit the classical subject of infinite products. For standard 
definitions and theorems on this subject see [1] or almost any textbook on complex 
analysis. We will restate parts of this material required to set the stage for our 
results, as follows. 

The infinite product P = []*(1 + a,) of complex numbers is said to converge if 
there is an integer N such that 1 +a, #0 for n = WN and lim, ,..[1"_,( + 4,,) 
is finite and nonzero. This occurs if and only if the series L* _, log(1 + a,,) 
converges. 

We say that P converges absolutely if [1° + |a,|) converges. If P converges 
absolutely then P converges, but the converse is false. The following theorem 

[1, p. 223] settles the question of absolute convergence of infinite products. 


Theorem 1. The infinite product [1°(1 + a,) converges absolutely if and only if 
la, | < &. 


If P converges but II°( + |a,|) does not, then we say that P converges 
conditionally. Conditional convergence of )*a,, does not imply conditional conver- 
gence of P. The following theorem [1, p. 225] seems to be the only general result 
along these lines, at least in the textbook literature. 


Theorem 2. If ¥*|a,|* < © then Xa, and T1*(1 + a,,) converge or diverge together. 


Here we offer some other results concerning convergence of infinite products. 
Because of Theorem 1, these results are of interest only in the case where 
La, | = ©. 


Theorem 3. /f there is a sequence {r,,} such that 


lim r, = 1 (1) 


| fn) 


and 

YIr,(1 te Ay Ped | SOO, (2) 
then TI*(1 + a,,) converges. 
Proof: Let g, =r,(1 + 4,) — 7,41. Then 


¥ |g.) < (3) 


from (2), so lim, _,..g,, = 0 and therefore lim, _,.,a, = 0 by (1). Choose N so that 
r,,1 +a, and1+g,/r,,, are nonzero if n => N. Now define p,_, = 1 and 


Pn = I] G+a,,), n=N. 
m=N 
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If n=>N then 1+ a4, =p,/p,-1, 80 &) =U pPa/Pn-1) —%n+1> and therefore 
Pn ='n+1 Pn + 8n/ln+1)/l,> Which implies that 


(1 Tue) Pay (4) 
N 


n 
Pati 


Pn =~ 


Since (1) and (3) imply that X”|g,, /r,,,,| < ©, Theorem 1 implies that the infinite 
product " 


QO = Ia a Bi/ Vint ) 


converges; moreover Q # 0 because 1+ g,,/r,,,, #0 if m = N. Now (1) and (4) 
imply that lim, ,..p, = Q/ry is finite and nonzero. Ss 


To apply this theorem we must exhibit a sequence {r,} that will enable us to 
obtain results even if *|a,,| = %. The following theorem provides a way to do this. 


Theorem 4. Suppose that for some positive integer q the sequences 


a) 


a) = Yaa, k=1,...,q (with a® = 1), 
m=n 
are all defined, and 


yo la,a,? |<. (5) 
Then TI*(1 + a,) converges. 


Proof: Define 
k 
rh) =1+ Yi (-1)laY, 1<k <q. 
jel 


We show by finite induction on k that 

ROC + an) — Os = (—1)"a,aP (6) 
for 1 < k <q. Since lim, _,,,r6” = 1 we can then set k = qg and conclude from (5) 
and Theorem 3 with r, = r‘ that TI°( + a,) converges. 


Since r“? = 1 — a the left side of (6) with k = 1 is 


(1 = a? )(1 a a,) — (1 = aii) oe ay, — a i a,a? a Se = —a a 


since a!!}, + a, = a‘). This proves (6) for k = 1. 
Now suppose that (6) holds if 1 < k <q — 1. Since r = r&*) 4 (-1)katD, 
(6) implies that 


k k k 
Geely ea) a a) aaa = 1) ae 


An 


Therefore 
k+1 k+1) _ k k k+1 k+1 k+] 
ry 6! + a,) = ei = (a) (a,a? — af — 4,4, a) 
= pa qk +) 


nAvn ) 


since a¥&*) + a,a® = at). This completes the induction. a 


We now prepare for a specific application of Theorem 4. Henceforth A is the 
forward difference operator; thus, if {g,,} is a sequence, then Ag, = 2,41 — 2m 
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while if G is a function of the continuous variable x then AG(x) = G(x + 1) - 
G(x). Higher order forward differences are defined inductively; thus, if v > 2 is an 
integer, then 


A'S = A181 — AB = (-1)" (Bae 
r=0 


A similar definition yields A’G(x). 


Lemma 1. Suppose that t is a real number, not an integral multiple of 27, and 
{Smm—o #8 a sequence such that lim,, -, 08m = 0 and 


| A’e, | < © (7) 


imt 


for some positive integer v. Then L°g,,e'”' converges and 


oa) p-1 ore) 
2 ee — (1 = e') i 3 Ag. 4 eivt 2 (A’g,, em" (8) 
m=0 s=0 m=0 
where 
yv—-1l 
= ee | 4 imt ou 
A, 2 | 1) jaan , Ce wa p= 1. (9) 


Proof: Suppose that M > 2» and let 
M 
Sy=(1—e)” D ene (10) 
m=0 
Since 
(1 _ eit)" eimt = py (-1)'{ Y lekm, 
r=0 


we have 


I 
Los 


v v M 
Sx Zn y- (—1)'[ Jeter _ > (-1)'(”) y; ee iia 
0 r=0 r=0 m=0 


I 
M< 
_~ 

| 

—_— 
we” 
~ 
—————. 
<= 
Se 
M4 
OE 
3 
a 
= 


Reversing the order of summation in the last sum yields 


sy E(E(-0 (Peale + E (E02) bee 


m=0 \r=0 : m=v \r=0 


M+yv v 

+E [ E (-17(%) em] 
m=M+1 \r=m-M x 

Since lim,, ,.2,, = 0 the last sum on the right converges to 0 as M— ~. The 

second sum on the right is 


M . . M-v . 
y) ( A” Re — eivl > (A’g,, er”, 
m=v m=0 
which converges as M — ~ because of (7). Therefore 


v—1l m ee 
im Sy = 5 = D (E(-1)'(7} meen +e E (arene 


M~ WO =O e m=O 
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which can also be written as 


v—1 
S= > A,g, ale ei¥t y (A’g,, )e™, 
s=0 m=0 
with A, as in (9). This and (10) imply (8). a 


Henceforth we write G(x) = O(x~*) to indicate that x*G(x) remains bounded 
as X > ©, 


Definition 1. Let ¥, be the set of infinitely differentiable functions F on [1,%) 
such that 


FO(x) = O(x"*”), v=0,1,... (11) 


For example, let F(x) = u(x), where u is a rational function with positive 
values on [1, ©) and a zero of order p > 0 at ~; then F satisfies (11) with a = py. 
To see this, we first recall that if f = f(u) and u = u(x), the formula of Faa di 
Bruno [2] for the derivatives of a —" function says that 


sohlu(x)) = De ik) en er (4) (FS) (2). (12) 


where the prime denotes differentiation with respect to x. We are assuming here 


that the derivatives on the right of (12) exist. Here u,..., u°”? are evaluated at x, 
and »). is over all partitions of r as a sum of nonnegative integers, 
CP af, — (13) 
such that 
hy 2h a (14) 


Applying (12) with f(u) = u” yields 
v ! ul(x)\"'(u"(x)\" cre) \ 
My) = yr ae - 
re Boe ae) (ae) (Se) 


pv} 
where(y)” = y(y — De (y —r + D. Since u(x) = O(x-?~'), it follows that 
(ur (x))(u'(x)) (u(x)? (U(x) = OC), 
where 
N= Dp) (por lier (per 2) i ee Pee) 6 py P 
because of (13) and (14). This verifies (11) with a = Py. 


For our purposes it is important to note that ¥, is a vector space over the 


complex numbers. Moreover, if F; © %,,i= 1,2, then FF, © F ,,.. 


Lemma 2. Jf F © F, then 
A’F(x) =O(x-*”), v=0,1,2,... 


Proof: We show that 
JA’F(x)|<K max |F™(é)|, (15) 
X<E<X+D 


where K is a constant independent of F. Since F(x) = O(x-*~”) this implies 
the conclusion. 
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To verify (15), we note that if x > 1 and r > 0 then Taylor’s theorem implies 
that 
v-1 FM y FO E, 
F(xt+r)= )v ello = ie) 


m= 


where x < &€ <x +r. Since A’F(x) = X?_)(- ry(*) PG + r), it follows that 


? 


m! yp! 


v—-1 Fo os 1 v — 
ign oS PG ye? i Hho ilganeo) 
m=0 ‘ ~=0 
Since L?_)(—-1)"" (he = 0 for m=0,. — 1, we can now infer (15) with 
= (Dro )r")/21. ™ 


Lemma 3. Suppose that F € ¥,. Let v be a fixed positive integer and let t be a real 
number, not an integral multiple of 27. Then 


>, F(m)e'™ ear G(n)e™ a One" y: 


where G & ¥, (and G depends upon v). 


Proof: We write 


y) F(myei™ =e YY F(n+mye™, (16) 
m=n m=0 

From Lemma 2, A’F(n + m) = O(n + m)~°*~” ); that is, there is a constant A 

such that |A’F(n + m)| <A(n +m)-°-” if n +m > 0. Therefore, if n > 2, 


= 1 n+m dx 
y WF(n +m) <A DY Oo =o <A Df 


m=0 m=0 “ntm- 1(x+a) 


00 dx 
J - SOC er) 
n—-1 (x i a) 
Applying Lemma 1 (specifically, (8)) with g,, = F(m + m) and n fixed shows that 


3 F(n + mje" = G(n) + O(n-*""*") 


m=0 
with 
Sead p= | 
G(x) = (1 -e”) LAE +s), 
=0 
so G € ¥,. Now (16) implies the seaciuion. i 


The following theorem shows that Theorem 4 has nontrivial applications for 
every positive integer q. 


Theorem 5. Suppose that 


a, =f(nje"’, n=1,2,3,..., (17) 
where f © F, for some y € (0, 1], and let q be the smallest integer such that 
(q+1)y>1. (18) 


Then the infinite product P = [1° + a,,) converges if @ is not of the form 2ka/r 
with k an integer andr € {1,..., q}. 
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Proof: We show by finite induction on p that if p = 1,...,q then 
aa? =f, (njePFD"® 4 O(n (PtDy-a4P ) (19) 


where f, © %,,,. In particular, (19) with p=q implies that a,a\? = 
O(n 977), so (18) implies (5) and P converges, by Theorem 4. 
From (17) and Lemma 3 with t = 6, F = f a= y,and v=q, 


qa) _ E f(mjyer" = G(n)ei"® alt O(n77-49*! ), 
with G, ©.%. Therefore aa) = = f(nye'""(G (ne? + O(n-*4*" )). Since f € F 


this can be rewritten as a,a\ = fi (nje7"® + O(n-*7-4*"), with f, = fG, © 4H, 


ee ti 


This establishes (19) with p = 1, so we are finished if g = 1. 

Now suppose that g > 1 and (19) holds if 1 <p <q. Since (p+ 1)@ is by 
assumption not an integral multiple of 27, Lemma 3 with t= (p + 1)6, F=f, 
a=(pt Dy, and vy = q — p implies that 


y ie (m) elt De = G ene C20 she O(n7 (pt+l)y- Cel). 


m=n 


where G, © &,,,1),. This and (19) imply that 
at) = i a,,a a?) = =G pene eee fe O(n (pt+l)y-qtptl ); 


SO 


nAvn 


a gt) = f(nye'"*(G, (n)eiPt dno shi O(n rae )). 


Since f © &, this can be rewritten as 


a,aPrh = Pune eo ae O(n (Pt 4tP +t ), 
with f,4, =fG, © A,+2),- This completes the induction. ai 


Corollary 1. Suppose that {a,}° is as defined in Theorem 5. Then the infinite product 
[1° + a,) converges if 6 is not a rational multiple of 27. 


Corollary 2. Suppose that a>0O0 and R is a rational function such that 
st > 0 on [N,~) (N = integer) and lim, _,,,.R(x) = 0. Then the infinite product 
TI?_y( + (R(n))%e'"") converges if 6 is not a rational multiple of 27. 


Corollary 3. The infinite product 11°. + n-“e'"®) converges if a > 0 and 0 is not a 
rational multiple of 277. 
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H. J. S. Smith and the Fermat Two 
Squares Theorem 


F. W. Clarke, W. N. Everitt, L. L. Littlejohn, and S. J. R. Vorster 


This article is dedicated to Professor P. R. Halmos 


1. INTRODUCTION. In his remarkable book <A Mathematician’s Apology, 
G. H. Hardy wrote [12, p. 97] 


Another famous and beautiful theorem is Fermat’s ‘two square’ theorem. The 
primes may (if we ignore the special prime 2) be arranged in two classes; the 
primes 


55135107, 29537 Al esiga 
which leave remainder 1 when divided by 4, and the primes 
35 Teddy A923 531 yates 
which leave remainder 3. All the primes of the first class, and none of the 
second, can be expressed as the sum of two squares: thus 
5=174+27, 13=274 3° 
17=1°+ 47, 29=27 + 5? 
but 3, 7, 11 and 19 are not expressible in this way (as the reader may check by 
trial). This is Fermat’s theorem, which is ranked, very justly, as one of the 


finest of arithmetic. Unfortunately there is no proof within the comprehen- 
sion of anybody but a fairly expert mathematician. 


The history of this theorem of Fermat is given in detail by Dickson [7, 224-237]. 
Dickson names the theorem after Girard, who discussed the result in 1632; 
however the common practice now is to attribute the result to Fermat, who stated 
in 1659 that he possessed an irrefutable proof by the method of infinite descent; 
see [7, p. 228] and [2, p. 89]. The first recorded proof is due to Euler given in 1749 
[7, pp. 230-231]; Bell writes, “It was first proved by the great Euler in 1749 after 
he had struggled, off and on, for seven years to find a proof” [2, p.89]. The first 
proof that such prime numbers can be uniquely represented as the sum of squares 
of two positive integers was given by Gauss in 1801 [7, p. 233]. See also the account 
of the two squares theorem of Fermat in the books by Burton [4, Chapter 12, 
Section 2], and Hardy and Wright [14, Chapter XX]. 

The last sentence in the quotation from Hardy is significant. Hardy had an 
interest in the classification of proof; see, in particular, [13, p. 6] in connection with 
the “elementary” proof of inequalities. In this context the word elementary must 
not be confused with the words obvious or easy; many of the elementary proofs in 
[13] are subtle, ingenious, and far from obvious. When Hardy wrote [12] he was, 
more than likely, not aware that an elementary proof of this theorem of Fermat 
had been given in 1855 by H. J. S. Smith, one of his predecessors in the Savilian 
Chair of Geometry in the University of Oxford. This simple but remarkable proof 
of Smith is within the comprehension of those with knowledge of elementary 
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algebra, including simple properties of determinants, and the fundamental theo- 
rem of arithmetic [6, Chapter I, Section 4]. The proof is also remarkable for giving 
a construction that permits one to compute the integers of the two squares 
representation. 

In this paper we give Smith’s proof of the theorem of Fermat and present what 
is, possibly, a new elementary proof of the uniqueness of the two squares represen- 
tation, but now using Smith’s ideas and method. This uniqueness proof involves the 
Euler Criterion [8, Section 11] for solutions of the quadratic equation a | 
(mod p); we present a new existence proof that leads to a constructable solution 
of this equation. 

The original paper of Smith [20] is (the good news) only 2 pages long but is (the 
bad news for most of us) written in Latin; see also the collected works of Smith 
[21], in which [20] appears as the second contribution. The Smith proof has not 
gone entirely without notice; Chrystal [5, p. 471] reproduces the proof in English, 
as does, in part, Dickson [7, pp. 240-241]; Davenport mentions the proof [6, p. 122] 
but does not give complete details. Barnes [1] gives an exposition of Smith’s 
existence theorem, and establishes the connection between the Smith palindromic 
continuant and the Euler Criterion (see our Theorems 1 and 2 and their proofs). 

Both Serret [19] and Hermite [17] use ideas similar to the Smith method [20] to 
give an algorithm for finding the integers in the two squares representation of the 
theorem of Fermat. This method was subsequently improved by Brillhart [3] to give 
an impressively fast numerical procedure to determine the representation; as an 
example the Brillhart method gives 

10°? + 577 = 7611065343808354245450401* + 6486268906873921642245424?. 

(1.1) 
The two squares theorem of Fermat continues to attract attention; see the recent 
contributions by Ewell [9], Heath-Brown [16], Wagon [22], and Zagier [23]. 

In Section 2 we give formal statements of the results to be proved by the Smith 
methods. In Section 3 we give a brief account of the life of Henry Smith. In Section 
4 there is a definition and statement of the properties of continuants. The 
remaining sections are devoted to proofs of the results. Lastly, in an appendix, we 
reproduce the original Smith paper [20]. 

At the end of Sections 6, 7, and 8 we exemplify the general results by 
considering the case p = 13, and other cases including the example in (1.1). 


2. STATEMENT OF RESULTS. Let N := {1,2,3,...} and P:=={peEN:p isa 
prime number}. 


Theorem 1 [Fermat and Gauss]. Let p € P with p = 1 (mod 4). Then there exist 
two unique, positive, co-prime integers u,v © N such that 


p=u* +0’. 
Proof: See Sections 6 and 7. ins 


Theorem 2 [The Euler Criterion]. Let p € P with p = 1 (mod 4). Then 
1. The quadratic equation 


x* = —1(mod p) (2.1) 
has two unique solutions x), x, € N such that 
ny Ce 1)72: sand. (pl) 7/2 <4, <p, 
with x; = p — Xo. 
2. All other solutions of (2.1) are congruent to x, or x, (mod p). 
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Proof: See Section 8. i 


For a detailed discussion of the Euler Criterion see the book by Dudley [8, pp. 
85-86]. 


2. HENRY JOHN STEPHEN SMITH. Henry Smith was born on 2 November 
1826 in Dublin, Ireland. His father died soon afterwards and the widow moved 
with her family to England. Smith was educated first by his mother and then by a 
succession of private tutors, before spending three years at Rugby School; from 
this School he gained entry to the University of Oxford, in 1844, by winning the top 
scholarship to Balliol College. In 1848 at Oxford he gained first class honours in 
both classics and mathematics; he also won the major University prizes in both 
these subjects, the Ireland scholarship in classics, and the Senior Mathematical 
Scholarship in mathematics. 

In 1849 the Balliol College fellowships in classics and mathematics fell vacant; 
until this time Smith seems to have been undecided as to whether to follow a 
career in classics or mathematics, but seems to have settled at this time on 
mathematics. His first paper, on geometry, dates from the next year. 


Henry John Stephen Smith (1826-1883). 
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Smith completed his first paper on number theory in 1854 and published it the 
following year in Crelle’s Journal |20]. Unusually, even for that time, it was written 
in Latin, perhaps in homage to Carl Friedrich Gauss, whose Disquisitiones arith- 
meticae served as an inspiration. 

In 1859 Smith was elected to Fellowship of the Royal Society of London and 
then, in 1860, to the Savilian Chair of Geometry in the University of Oxford; two 
of his eventual successors to this Chair were G. H. Hardy and E. C. Titchmarsh. 

Henry Smith died in Oxford in 1883. 

It is puzzling that Henry Smith’s work and name are so little known, even 
amongst those who make regular use of the ideas that he introduced; this point is 
discussed by Keith Hannabuss in his paper The mathematician the world forgot [10]. 
Several historians of mathematics have ranked Smith with Cayley and Sylvester 
among the great pure mathematicians of the nineteenth century. In 1861 Smith 
proved the existence and uniqueness of what is now called the Smith Normal Form 
of a matrix with integer entries. This result has subsequently been used to prove 
the cyclic decomposition for modules, but Smith’s first application was to deter- 
mine when linear Diophantine equations admit solutions, settling a longstanding 
problem first studied by Greek mathematicians. His remarkable contributions to, 
and his panoramic knowledge of, the theory of numbers can be seen in the 
monumental Report on the theory of numbers, reproduced in [21]. In this area, in 
1868, he shared the Steiner Prize of the Royal Academy of Sciences in Berlin for 
his solution of a geometric problem that involved the representation of integers as 
a sum of squares. 

Not so well known is Smith’s early contribution to measure theory and integra- 
tion in his paper of 1875 On the integration of discontinuous functions; see [21, paper 
25]. There, Smith introduced the first example of what is now called a Cantor set; 
Cantor’s own example appeared eight years later and was not presented as his own 
discovery. Smith’s example divides an interval into m > 2 subintervals, and then 
keeps repeating this process to each remaining subinterval, except the last. Smith 
also seems to have been the first mathematician to perceive the connection 
between measure and integral. However, his paper received less attention than it 
deserved, owing to an inaccurate review in the Fortschritte der Mathematik. In his 
history of integration, see [15, pp. 37, 40], Thomas Hawkins has remarked: 


Probably the development of a measure-theoretic viewpoint within integra- 
tion theory would have been accelerated had the contents of Smith’s paper 
been known to mathematicians whose interest in the theory was less tangen- 
tial than Smith’s. 


For an informed discussion on the contents of this paper of Smith, and for the 
development of the ideas therein to higher dimensions, see [10] and, especially, 
[11]. 


4. CONTINUANTS. Continuants are closely connected with continued fractions, 
as noted by Smith at the beginning of [20]. There is a detailed and elegant account 
of this connection in Chrystal [5, Chapter XXXIV, Sections 4-11]. However Smith 
uses Only continuants in his paper and uses determinants to define them; for this 
definition see [5, Chapter XXXIV, Section 11] and the reference therein to the 
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remarkable history of determinants by Muir and Metzler [18, Chapters III and 
XIII]. We follow Smith and make the 


Definition 1. For n € N let g, € N (ry = 1,2,..., 7); then define [-]: N’ ~ N by 
the determinant 


q 1+ 0 - 0 O 
=f Q {i . es 0 0 
0 -1 q « 0 0 

141.922 939+++9 In-19 Ind =] oe -| (4.1) 
0 OO O gq, 1 
0 0 OQ: ss =1 @ 


We note that 
[aql=a, [4,4] =4192 +1, and [q,,q,93] = 419293 + 9, + G3. (4.2) 


Lemma 1. Letn EN with n > 2. Then 


1. [413 Go5-++s And = Caillge, 935---5 Ind + 143,--+5 An 

2. [915 9ax++ +5 Mal EN 

2; (41, Gys+++5 Qn) = [Gast Gov Qi] 

» £4o5435++ +5 Ind < [G15 9a5-+ +5 Ind 

© 1455 I++ +> In) and 1qy, Qo,---59,] are co-prime integers 


Nn 


[Qisaaes Oia Ges Gis Gages Gel = sehen Gel Gis a oo leg, | 
sel peer aera | seer Pal 


Proof: Note that if in any formula in Lemma 1 an empty continuant appears then 
it is convenient, and consistent, to give such a continuant the value 1. 


1. Expand the determinant (4.1) by the first row. 

. Use (4.2), property 1 and mathematical induction. 

. Standard property of determinants. 

. Use properties 1 and 2. 

. Use property 1. 

. Use the Laplace expansion on (4.1) centred on row s; see also the proof in [5, 
Chapter XXXIV, Section 6]. 


Nm BW WN 


5. THE EUCLIDEAN ALGORITHM 


Algorithm 1. Let r,s € N be co-prime with s < r and write 


r t ; S u , U 0 
7, oe ae <t<s),-=qat+- SUVs = SS Sg, 
a ae ( 5), 5 Qt (O<u<t) go te a 


(5.1) 


for somen € N withn > 2, q, Ee NG = 1,2,..., 7), and q,, = 2. 


Thus a rational number r/s > 1 is associated with a set of positive integers 
(41, 9o5+++>Y,} Satisfying the properties in (5.1). Conversely we have 
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Lemma 2. Let a set of positive integers {q,,q>,...,9,} be given with n >2 and 
Gd, = 2. Then there is a unique rational number r/s > 1 whose Euclidean algorithm 
yields the set {q,,qo;---+,4,}; moreover, r/s is determined by 


F [415 925-05 An] 


7 = 5 52 
s [925 930--+5 I ( 

Here r and s are co-prime and given by 
r= (41) a5+++s In] and § = [G55 9ay<29q; |. (5.3) 


Proof: Define r/s by (5.2) and apply property 1 of Lemma 1 n times. The result 
(5.3) follows from property 5 of Lemma 1. fs 


6. THE SMITH PROOF OF THE FERMAT THEOREM 


Proof: Let p € P with p = 1 (mod 4) and write p = 4r + 1. Let the number pu be 
taken arbitrarily from the set of positive integers {1,2,...,2r} and consider the 
corresponding set of rational numbers {p/w}, noting that 2 < p/p <p. If we 
apply Algorithm 1 to p/p we obtain a representation of the form 


PL Gay ed (6.1) 
LM [das-+-9 An 
Of course, the integer n and the set {q,,q>,...,q,} depend upon the particular 
choice of w. From property 5 of Lemma 1 we obtain 
P = 141: 920-++5 In] and = [do5---5 Qn l- (6.2) 


From Algorithm 1 and from property 1 of Lemma 1, since p/p > 2, it follows that, 
in the representation (6.2), 


q,=22 and gq, =2. (6.3) 
Now take one of the rational numbers p/p with 
fi S 42s Spates 2h 


then we have the following chain of argument, using property 3 of Lemma 1 and 
(6.1), 


B= Mee), ry. bad os 
fay Peper 129 YU29°*+9 Yn n? aa 1 


[Guerin dane Og ae (6.4) 


LGn-19++-9 M1] v 
(say). It follows from (6.3), Lemma 2, and property 1 of Lemma 1 that 1 < pv < p/2, 


so v € {2,3,...,2r}. Thus the chain of argument that gave (6.4) can be reversed, 
starting with v and finishing with wp. 
This argument pairs off the elements of the set {2,3,...,2r} and gives each 


member yp of the set a unique mate v in the set. However this set contains an odd 
number of elements so there must exist at least one member, say A, that mates 
with itself in the chain (6.4). For this A we obtain from (6.4) 


Ldir arses Ind PL n-ty G (65) 


Gisecey Gi A Gi iseeedy | 
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Now apply Algorithm 1 to both sides of (6.5) to give a representation 


P= [411929--09 Mn, (6.6) 
with the palindromic property, and with (6.3) holding, 
= Gre (@=12 00). (6.7) 


If, in (6.7), n = 2t + 1 is odd then n > 3 and the representation (6.6) takes the 
form, for s > 2, 


P= [1p 065 Os-19 Is Us-9 oe HI: 
Now apply property 6 of Lemma 1 to give 
P= [Gyre 0 Qs 19 Is IGs—a9 009 GI) + [die Ie WU Gs-2 9 QT. 
Other properties of Lemma 1 permit us to write 


P=: [GisseoeGeca ll Gist el PF LG eal 


which represents the prime number p as the product of two factors that, using 
(6.3), are both greater than 1; this is a contradiction to p € P. 

Thus in (6.7) the integer n = 2t¢ must be even and so (6.6) takes the form, for 
SS 1. 


P= [cea her: pee | 
with g, => 2 from (6.3). Now apply property 6 of Lemma 1 to give 
P= Wiper 9 ANG isacesGi\ died Gee Miao ese Gi) 
and then 
2 2 
P= ised FP 1 eiseeesGsai | 


From property 1 it follows that [q,,...,q,_,] and [q,,...,q,] are co-prime. 
This completes Smith’s proof of the Fermat part of Theorem 1. a 


Consider the case p = 13. Then pw € {2,3, 4,5, 6} and the application of the 
Euclidean algorithm to each choice of p gives 


m=2 n=2 q=6 G=2 
pars n=2 Q=4 gs 
w=4 n=2 Q=3 GQ= 
aS WS Qyaee. Wy liga). 394 = 
M=6 n=2 gq, =2 q=6. 


Thus in the Smith pairing, 2 pairs with 6,3 pairs with 4, and 5 pairs with itself. The 
palindromic continuant given by 5 then yields the two squares result 


13 = [2,1,1,2] = [2,1][1,2] + [2][2] = [2.1] + [2] = 3? + 2°. 


7. A COROLLARY. We have 


Corollary 1. Let p © P with p = 4r + 1. Then there are exactly 2r distinct continu- 
ant representations of p 


| Liam ry | 
with q,, = 2. 
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Proof: Let w € {1,2,...,2r}; then from (6.1) 


Pp [aisQose 203g, | 
Eee Hal gat Fa, bs 7. 
be [do5--++ Fn] aie ( ) 


with q, = 2; these continuant representations of p are distinct since otherwise, 


from (7.1), p/w = p/p for w # w. 7 

Let p have a representation p = [q,,q),...,q,] with g, => 2. If n = 1 then 
4d, = 4, = p and we can take w = 1 in (7.1). If m = 2 then, since q, > 2, it follows 
from properties 1 and 3 of Lemma 1 that [q,,...,q,] < (p — 1)/2 so that in (7.1) 
we have [q,,..., 9,1] € {2,..., 27}. a 


For p = 13 we obtain six continuant representations 
13 = [13] = [6,2] = [4,3] = [3,4] = [2,6] 
and 


13 = [2,1,1,2] = 


pe Re - © 
NRF Cc OS 


2 1 
= 1 
O -1 
0 i = 


8. PROOF OF THEOREM 2. We begin with 


Lemma 3. Given anyn € N withn = 2 and any set of positive integers {q,, q,---5 ns 


define 
TGs Ga5+++5 On) = [41s a+ ++ Ind G23 9350+ +> In—-1] 

—[41,9250009 In-1 425+ Mn I- (8.1) 

Then 
LC 91> 920-089 4n) = (a1) (8.2) 

Proof: We have, from (4.2), 
1,(41592) = [41,421 — lai Jla2] = 1. (8.3) 
For the general case we have, from property 6 of Lemma 1, 

PGipees Gy) = bGiskase Gyat 1d eG mexasG yz (8.4) 
Greater Gal: = LO rected Ia Masa woen Gash: (8.5) 


Multiply (8.4) by [q,,...,¢,-,] and (8.5) by [q,,...,q,_,] to give, using (8.1), 
TGs Ga9+++9 In) = [415-02 In—2 W429 +3 In-1) — [925-262 Ina Wis Mn-1 
= — 1,113 das++ +s In-1)- 
Repeated application of this last result yields 
Gye) AN) Tea Gecesgie), Are (2am 2), 


taking r =n — 2 and using (8.3) gives I,(q1, q,---.9,) =(-1)” *L, = (- 1)", so 
(8.2) follows, as required. a 


Proof of Theorem 2, Part I. Let p € P with p = 1 (mod 4). Then the proof of 
Theorem 1 ensures that p has at least one palindromic continuant representation 


P= [419-669 M50 Igo M11 (8.6) 
with s > 1 and q, = 2. 
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Define Xy e N by 


Xq = [Go5--+5 952 Igo--- M13 (8.7) 
it follows that, from qg, = 2 and property 1 of Lemma 1, 
<x) = Cp =—1) 72: (8.8) 


Now apply the result of Lemma 3 to the right-hand side of (8.6), with n = 2s, to 
obtain 


Osi eos Gs 280 I Gps Ay Gisseas gs | 


AO psenite ep paces On Organs Ges jaan yl — (-1)" = |. 
From (8.6), (8.7), and property 3 of Lemma 1 the preceding result reduces to 


PLGa5+++9 959 9s9-++9 2] — X90 = 1 (8.9) 
and so 
xg = —1(mod p). (8.10) 
This result, together with (8.8) completes the proof of Part 1. Psi 
Remarks 


1. We note that (8.9) was known to Henry Smith and is stated at the end of [20]. 
2. We note also, from the proof of this part of Theorem 2, that the number 


d= Xp (8.11) 


is a member of the set {2,3,..., 7} and, for this choice of A, the quotient p/A 
yields the palindromic continuant representation of p; see (6.5). 


Examples 
1. For the case when the prime p = 13 we have the following explicit results 
P= [4159250063450 4sr0++9 92H] = [2,1,1,2] 
Xq = 1423-++5 959 Qo 92s M11 = [1,1,2] =5 
G5ict Qos; = yl) = 2. 
The general result 1 < x) < (p — 1)/2 becomes 1 < 5 < 6 in this case. We can 
now confirm the results (8.9) and (8.10): 
PL 905-655 U59 IWe9++29 Go] — XG = 132-57 =1 
and 
xg = 25 = 26 — 1 = —1 (mod 13). 
2. Let p = 1913; then p = 1 (mod 4) and, see (6.5) and (8.11), A =x) = 712 


since 7127 = 506944 = 265 x 1913 — 1 = —1 (mod 1913), with 1 < 712 < 956 = 
(1913 — 1)/2. The Euclidean algorithm yields 


p 1913 [2,1,2,5,5,2,1,2] 


DD ESS -9 19) 
and since [2, 1, 2,5] = 43, [2, 1,2] = 8 we have 43% + 8* = 1849 + 64 = 1913. 

3. For p = 969433, with p = 1 (mod 4) the appropriate palindromic continuant 
is p = [2,3,4,5,6,6,5,4,3,2] with [2,3,4,5,6] = 972 and [2,3,4,5] = 157; thus 
969433 = 972* + 1577. 
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4. For the example (1.1) produced by the Brillhart method, ie. p = 10°° + 577, 
it may be shown that 


A = X9 = 24574739597286316058804545812463447369459349571921 
and the relevant palindromic continuant is 
p= [4,14, 2,4, 4,1, 5, 1, 3,4, 2,17, 1,1, 1,3, 1, 2,1, 7, 1,4, 1, 1,2, 7,11, 1, 
LO; dp Spl lae OL ed 2 hs Oyen loys Oy dl, 
Ly 135551213 td 22 e219 14. 3, 3551 
LE eal le 2k glade beh Ae ool oA A | 
which has length 112. Thus we have, for the two squares representation, 
USA NAD AA S13 4nd obs ly be Oy Ode dod, Dede 2, Fol de 
15359535 15.14, 149; 12511, 1, 1522.4. 11,023,201, 1.1-3.3..1,5,.1] 
= 7611065343808354245450401 
and 
pe 4.14: 2 AAS 13.44 2s lis le ody hs Ole Ty bet, 2,7 Lied 
13,3335 15 142109 G21 1d) 22.4, 1 ls 32 1353135) 
= 6486268906873921642245424 
with p = u? + v’, as in (1.1). 


Proof of Theorem 2, Part 2. Suppose that r is another solution of the quadratic 
equation (2.1) with r#x, and r#p—x,; without loss of generality we may 
suppose that r is a least, positive residue (mod p). Then r* = xj = —1 (mod p) 
and hence p divides r* — xj = (r —x)Mr + x,); since p is prime it divides r — x, 
or r + xX). The former case implies r =x, (mod p), but since both r and x, are 


least, positive residues it follows that r = x). In the latter case r= —x) =p — Xo 
(mod p) and since r and p—x, are least, positive residues it follows that 
r = p — X,. This contradiction completes the proof of Part 2. a 


9. THE “SMITH” PROOF OF THE GAUSS THEOREM. We are now in a 
position to give a proof, using the methods of Henry Smith, of the Gauss 
uniqueness result for the Fermat theorem, as presented in Theorem 1. 


Proof: Let p € P with p = 1 (mod 4) and suppose that there are two, co-prime 
two squares representations: p = u? + v* and p =s* +r’, with u <u, 5 <r. 
Apply Algorithm 1 to the rational numbers v/u and r/s to obtain 


v A er e Nii Pe cents 
pe ON ne oN | 


u LO vena Gy s [bois a etl pc 


y) 


then 
U=([G5-+-5dn] and v= [415405-++9 Gn] 
s=[t,,...,t,]| and r=[t,,t,,...,t,]. 
Hence, property 6 of Lemma 1 ensures that 
paw tv? =[qo...0sdnl + [41s dare Mal 
= [ns 95925919 Ms Ga In] (9.1) 
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and 


2 2 
Dis Sr — [Poeedasd fs Dankel | 


>”°m 
So baal pelns biplanes ts |e (9.2) 


Part 1 of Theorem 2 guarantees that the continuants [g,_1,---5 15 91>++ +> Un—12 An] 
and [t,,-15-++sliti+-:stm—1tm) are both solutions of the quadratic equation 
ja 


x“ = —1 (mod p) and satisfy 1 <x < (p — 1)/2. From the uniqueness of this 
solution we have 


Le aig as Die Qi=is Ga F.. iy Agate lieesss pois al — p (say). 
(9.3) 


From (9.1), (9.2), and (9.3) it follows that 


ees sci GGos Gis Gin Goce Gy | = eset vin cbse a) 


p Een: re een eee an Ee re ree oe ere 
(9.4) 


Applying Algorithm 1 to both the continuant terms in (9.4) shows that m =n and 
g,=t; G@ = 1,2,++:,n); thus u = s,v =¢t and the uniqueness result is established. 
a 


10. APPENDIX. In this appendix we reproduce the original 1855 paper [20] of 
Henry Smith. 


DE COMPOSITIONE NUMERORUM PRIMORUM 
FORMAE 4A + 1 EX DUOBUS QUADRATIS 


Sit 
1 
q+ 1 
+ 
q2 Gig 
1 
+ —- 
An 
fractio continua, cujus numerator, qui determinanti 
”q 1 0 0 v 
0 -1 4q; 1 0 
0 ee G4 0 
0 0 0 OQ) 3 # el ug: 


aequalis est, per hujusmodi formulam [q, q, q;.-..4,-19,] exprimatur. Erit ergo 


[9192 ©» G19) = (4; 9-1--- 2%] 
et 


[41 +--+ @n) = [41 42 --+ G1 [Gini --- nd + 91 a2 + G11 Gise «++ Mads 
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quae aequationes pendent ab illa forma determinantali, ambae autem L. Eulero 
debentur. 

Itaque, si quantitatum g par sumatur numerus, ipsaeque ita serie symmetrica 
disponantur, ut binae inter se aequales fiant, elucet, quantitatem 
[91 90 +++ 4; 9+++ 9 G,] summam fore duorum quadratorum inter se primorum; fit 
enim 


2 2 
[4192-9 OU) =a m-..G) + lag... Gril. 


Contra in numero quotientium impari, erit 


[y+ G1 G1 N11 = [a---G-i]+{[In--- 4] F [a,---%-2]}, 


unde colligis, numerum [q,...g;...q,] primum esse non posse, nec duplicem 
numeri primi; si quidem casus excipis, in quibus, aut i unitati aequatur, aut 1 
binario, g, unitati. 
Sit p numerus integer datus; w,, W,... 4, Series numerorum, qui ad p primi 
sunt, ipsiusque p dimidio minores. 
P 


Formentur fractiones continuae ~, ~,..., 2; quae omnes ita terminentur, ut 
My B2 Ms 


is quotiens qui in extremo loco ponatur, unitatem superet. Hinc patet, quanta 
fuerit numerorum p,, M>5,... u, Multitudo, tantum fore numerum determinantium 
[g,-.--4,], qui dato numero p aequales erunt, neque praeter illos ullum dare 
ejusdem formae determinantem, cujus et primus et extremus quotiens unitate 
major sit, quique numero p aequalis esse possit. 

Jam vero, quum duo determinantes [q,...q,] et lg,...q,] aequales sint, 


quumque ipsum gq, unitate majus sit, apparet [q, ...q,] ex una aliqua fractionum 


” oriri. Unde sequitur, data quavis fractione “, inveniri posse aliam in eadem 
be be 


serie, quae quotientes eosdem, ordine inverso, repraesentet. 

Sit p primus, formae 4A + 1; ut numerus determinantium ipsi p aequalium par 
existat. Quum ipse p unus e determinantium serie fiat, unus certo alius inveniri 
poterit in quo quotientium ordo invertendo non mutatur. Cum sit ergo 


P= 141926 DH 
erit denique 
2 2 
P=([4%---G) + laid2--- G1). 


Quam theorematis Fermatiani demonstrationem maxime elementarem esse 
patet, quum pendeat a conversione fractionum vulgarium in fractiones continuas. 

Singulos autem formae 1 + x? divisores ex duobus quadratis conflari, eodem 
modo demonstrare in promptu est. Sit enim 


pv=14 x’, 
apparet fore 
M= (0199-2 
v= [42 93--- 99; --- 93 9] 


x= [4192---U 4+ I. 
Oxford, Maio 1854. 
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NOTES 


Edited by Jimmie D. Lawson and William Adkins 


Number Theory and Semigroups 
of Intermediate Growth 


Melvyn B. Nathanson 


The semigroup S is finitely generated if it contains a finite subset A such that 
every element of S can be written as a product of not necessarily distinct elements 
of A. The length of an element s € S with respect to A, denoted 1,(s), is the 
smallest integer m such that s can be written as the product of m elements of A. 
‘The length of the identity element is 0. The growth function y,(n) of S with respect 
to A counts the number of elements s € S of length at most n, that is, y,(1) = 
card{s € $:1,(s) < n}. 

The semigroup S has polynomial growth of degree k with respect to A if 
y,(n) <c n* for some positive constant c, and all sufficiently large n. For 
example, if S is the free abelian semigroup of rank k generated by a set A of 


Gag 


“,  } and 


cardinality k, then the number of elements of length exactly m is 


n _ nk 

vin) = (MERE NY = (MER) = + (nt), 
m=0 : 

so S has polynomial growth of degree k. 

The semigroup S$ has exponential growth with respect to A if there exists a real 
number 6 > 1 such that y,(m) > 6” for all sufficiently large n. For example, if S is 
the free semigroup of rank k > 2 generated by a set A of cardinality k, then the 
number of elements of length exactly m is k”, and 


n+1 


n 
WOODS Sra, 
m=0 (em 
so S has exponential growth. 
The semigroup S has intermediate growth with respect to A if, for every positive 
integer k and for every real number @ > 1, we have 


n* < y(n) < 6" (1) 


for all sufficiently large n. The purpose of this note is to give a simple and 
self-contained proof of the existence of semigroups of intermediate growth. 

The rate of growth of a semigroup S is an intrinsic property of the semigroup, 
that is, the growth is independent of the choice of generating set. This is easy to 
prove, since if A and B are two generating sets for S, then each element of B can 
be written as a finite product of elements of A. Thus, there exists a constant c > 0 
such that 1 ,(b) < c for all b € B. If s € S is a product of m elements of B, then s 
can be written as a product of at most cm elements of A, and so 1,(s) < clg(s) for 
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all s © S. Therefore, if /,(s) < n, then 1,(s) < cn, and y,(n) < y,(cn). Similarly, 
there exists a constant c’ > 0 such that y,(n) < y,(c'n). It follows that the 
functions y,(m) and y,(n) have the same growth rates (polynomial, exponential, or 
intermediate). 

It is not obvious that semigroups of intermediate growth exist. In the analogous 
case of a group G generated by a finite set A, the length of an element g € G is 
the smallest number m such thatg can be represented in the form 


g=at!.. qt! 
where d,,..., a,, © A. In a famous problem in this MONTHLY in 1968, Milnor [5] 


asked if there exist groups of intermediate growth. The question for groups was 
answered affirmatively by Grigorchuk [1, 2, 4]. Semigroups of intermediate growth 
have also been constructed, but the proofs that their growth functions satisfy (1) 
are complicated. The semigroup constructed in Theorem 1 is known, but the proof 
by means of elementary number theory is new. We prove the intermediate growth 
property using only Chebyshev’s Theorem [6, Theorem 6.3] that a(x), the number 
of primes up to x, satisfies the inequalities 


ee a (2) 


for all x > 2, and the following simple upper bound for the Hardy~Ramanujan 
partition function p(n). 


log x logx 


Lemma 1. For every integer n > 2, p(n) < 2n?v", 


Proof: The function p(n) counts the number of partitions of a positive integer n. 
A partition of 1 is a sequence of positive integers u,,...,u, such that n =u, + u, 
+++ +u, and u, >u,> *- >u,. Associated to this partition is an array of 
points, called the Ferrers graph, consisting of u, points on the first row, u, points 
on the second row,...,u, points on the s—th row. For example, corresponding to 
the partition 19 = 7+5+4+2 + 1is the graph in Figure 1. 


e e e e 
e e 
e 
e e 
e 
Figure 1 


Consider the largest square array of dots that can be found in the upper left corner 
of the graph. In the example in Figure 1, this square consists of three lines, each 
with three points. The length of this square is some positive integer r < yn, and 
the square contains r? points. The remaining n — r* points in the graph must lie 
on the first r lines, to the right of the square, or on the first r columns, underneath 
the square. The number of ways to add these points to the graph is at most n*”. 
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Therefore, the number of partitions of n satisfies 
[yr] 7 
p(n) < din? < 2nv, 


r=] 


Theorem 1. Let S be the semigroup of 2 X 2 matrices generated by the set_A = {a, b}, 
where 
fi A eee 6 
a-({ 4 and b= (1 ie 


2 2c1yn /logn < y4(1) < n’p(n) < I n2vn +2 (3) 


for some constant c, > 0 and all sufficiently large n. In particular, S has intermediate 
growth. 


Then 


Proof: We observe that the matrices a and b satisfy the identities 


b* =b, 
x _{1l ek 
° t i) 
and 
mk —_{kK+1 Oj) _ 
ba“b ee 2 (k+1)b 
for every positive integer k. Therefore, for any positive integers k,,..., k, we 
have 


ba*'ba*?b +++ ba*tb = [bay = [TT + 1)]b. (4) 


Let p1, P>,---, p, be distinct prime numbers not exceeding yn . Then 
ba?!~'ba?2-'b +++ ba? 'b = [T1p,}.. 
By Chebyshev’s Theorem (2), the length of this seed does not exceed 
1+ Das +isiia(n)+1s 2 +10 


for all sufficiently large n. Therefore, each of these elements is counted by the 
growth function y,(n). These elements are distinct, since every positive integer is 
uniquely a product of primes, and so every subset of the primes up to Vn produces 
a different element of the semigroup S of length at most n. This gives the lower 
bound 


¥4(7) > Q7(yn) > J2¢1V0 /logn- 


Next we compute an upper bound. Let s € S have length 1,(s) < n. There are 
three possibilities. First, there are n+ 1 elements of the form s =a" with 


0 <u <n. Second, there are ("+ ! 


and 0 <u +u<n-—1. Third, we can have s = a“s'a’, where s’ = ba*'ba*2b --- 
ba**b, u,v are nonnegative integers with u + v <n — 3, and 1,k,,...,k, are 
positive integers such that k, + ++ +k,+r+1<n. Equation (4) implies that 
s’ = bakemba*emb ++» bakeb for every permutation o of 1,...,7, so we can 


] elements of the form s = a“ba’ with u,v > 0 
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assume that k, > + >k,>1. Let k; =k, + 1. We associate the following parti- 
tion of n to the semigroup element s’: n =k) +--+ +k, +1+--- +1, where 
ki, = ++ =k, > 2 and the number of 1’s in the partition is 


n—- Yi ki>1. 
i=1. 


This is a one-to-one mapping of the elements s” to partitions of n. It follows that 


there are at most p(n) semigroup elements s’ of the form (4) with k, + ++: +k, + 
-1 


; ] choices of nonnegative integers 


r+ 1 <n. Since u + v <n —3, there are (” 


u,v, and so there are at most (" ; '|p(n) semigroup elements of the third type. By 
Lemma 1, for n > 2 we have 


y(n) <nt+1t ’ 5; : + [” i "| p(n) <n’p(n) < 2nrvn +2, 


This gives the upper bound. The semigroup S has intermediate growth since the 
left side of inequality (3) grows faster than any polynomial, while the right side 
grows slower than any exponential function. a 


Let A be a finite subset of a group, with 1 <A. Let S be the semigroup 
generated by A, and let G be the group generated by A. Denote by A” the set of 
all products of n elements of A. Denote by A*” the set of all elements of the 
form at! --- a*'. The growth function of the semigroup S with respect to A is 
Y4,s(n) = |A”|, and the growth function of the group G with respect to A is 
Yo) = |A*"|, Clearly, y, 5(2) < yg) for all n. It is natural to ask if these 
two functions must have similar growth rates. Grigorchuk [3] proved that if y, 5(7) 
has polynomial growth of degree k, then y, (1) also has polynomial growth of 
degree k. It is not known if it is possible for the semigroup function y, ,(1) to 
have intermediate growth while the group function y, (1) has exponential growth. 
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The Gottschalk-Hedlund Theorem 


Randall McCutcheon 


In 1955 Gottschalk and Hedlund proved that if X is a compact metric space and 
T:X — X is a minimal homeomorphism (in other words, if the only closed sets 
Y CX for which TY C Y are Y= X and Y = ©), and if f: X — R is continuous, 
then f = g — Tg for a continuous function g (where 7Tg(x) = g(7x)) if and only if 
there exists K < © such that |~)_,7T”f(x)| < K for all NG Nand x eX [3]. 
Necessity is obviously violated if one allows X to be non-compact. However, the 
following theorem, which for Hausdorff spaces is due to Browder [1], is also true: 


Theorem A. Let CX, T) be minimal, where X is any topological space and T : X > X 
is continuous. Let f : X — R be continuous and suppose that for some K < © we have 
[o_o T"f(x)| < K forall N € N and all x © X. Then for some continuous g we have 


f=eg- Tz. 


In 1993 the author, at the request of P. Schwartz, produced a proof of the 
Gottschalk-Hedlund theorem. Jt amounted to a minor alteration of a proof found 
by the author in 1989 of the fact (due to Bohr) that the integral of an almost 
periodic function, if it is bounded, is itself almost periodic; see [2, Theorem 5.2]. 
The proof, which sufficed for Theorem A as well, was so innocuous as to seem 
hardly interesting. 

Schwartz, however, noticed something novel in the proof that “made the 
existence of a more general cocycle theorem seem likely” [6]. Indeed, he was able 
to adapt the proof to obtain a generalization of the Gottschalk-Hedlund theorem 
in a setting involving convolution operators. M. Lin and V. Bergelson then 
suggested that the proof would go in the context of Markov operators. Schwartz 
obtained something along these lines in [7]. 

Finally Lin and I. Kornfeld in [5] obtained a more general result of this type. 
Let X be compact space. A Markov operator on CCX) is a positive contraction T 
with 71 = 1. 


Theorem B. Let X be a compact Hausdorff space, and let T be an irreducible Markov 
operator on C(X) (see [5] for the definition of irreducible). If g = C(X) satisfies 
supylIZjLo7Z! gil < ~, then (and only then) there exists f © C(X) with g = f — Tf. 


The progressively more general results obtained in [6], [7], and [5] suggest that 
perhaps the proof we present here is somewhat interesting, after all (the central 
idea is mimicked in all three of these cases). Some form of the proof could, of 
course, be distilled from any of the aforementioned papers (and, in fact, it appears 
explicitly in [6]), but it wouldn’t be completely clear how to do so most simply, as 
there are a few complications in the more general situations that require modifica- 
tion of the original argument. 

Since no extra effort is involved, we actually prove a version of Theorem A that 
allows for continuous time, in which case the functional equation takes the form 


— “g(T,x) = f(T,x); see [4, Lemma 2.7]. Note that the proof remains valid for 
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maps and semiflows as well as homeomorphisms and flows. Finally, we derive as a 
corollary (Theorem D) the result that led to the proof’s discovery: the integral of 
any almost periodic function over R, if bounded, is almost periodic. 


Theorem C. Let (X,{T,}) be a minimal flow (with either discrete or continuous 
time). Let f: X > R be continuous. If for all x © X there exists K < © such that 
| i Pee f(x) dt| < K for all L = 0, then there exists a continuous function g such that 


— 9(T,x) = f(T,x) forallx © X. 


Remark. In the case of discrete time, we take [ . to mean Ly. 


Proof: For x € X, , put g(x) = supy so {o'T, f(x) dt. We claim that for every L > 0, 
o(x) = supy s, fT, f(a) dt. 

Suppose the claim is false. Then there exists x © X and L > 0 such that 
g(x) - suPy sr Sq T, f(x) dt = « > 0. Pick M <L such that g(x) = i T, f(x) dt. 
Then [. T, f(x) dt <-—e for all N>L. Let 5=infs.,f"T,f(x)dt and fix 
SS wih i T, f(x) dt < 6+. 

By minimality of the flow, {7,y:t > 0} is dense in X for every y © X since its 
closure is a non-empty invariant set. In particular (taking y = T, _,,x), we may 
choose r > L — M such that 7.x lies in a neighborhood of x suitably chosen so as 
to ensure that 


[OTA dt = [ TALx) dt<é8t+e. 


We have used continuity of the map x > [{ id, f(x) dt; this is a fairly routine 
exercise. Hence 


[TI dt = ec. dt + [Tse dt< -e+St+e=6, 
M r+M 


a contradiction that proves the claim. 
We now have, for all L > 0, 


g(T,x) = sup fT,f(T,x) dt = sup [> T,f(x) dt = g(x) — fT f(x) dt. 
N>0 “0 N>0°L 0 
(1) 


By the fundamental theorem of calculus, — “9(T,x) = f(T,x) (the discrete case 


follows directly from (1) by letting L = 1). All that remains, therefore, is to show 
that g is continuous. 
By inspection g is lower semicontinuous. Let h(x) = infy, 9 / T, f(x) dt. Then 


A(T, x) = h(x) ~ fT F(x) dt. (2) 


To see this, just replace f by —f in the argument above. Clearly h is upper 
semi-continuous. Combining (1) and (2), we obtain (g — AMT, x) = (g — h(x) for 
all L > O and all x €_X. Let x, y © X and e€ > 0 be arbitrary. Notice that (g — h) 
is lower semicontinuous. Utilizing the denseness of orbits, we obtain L > 0 such 
that 7,x is “close enough” to y to ensure that (g —h\(x) =(g —h\(T, x) = 
(g — h)(y) — e. However, since ¢ is arbitrary, (g — h)(x) > (g — h)(y). Reversing 
the roles of x and y, we see that actually (g — h)(x) = (g — h)(y), which implies 


Aug.-Sept., 1999] NOTES 671 


that (g — h) is constant on X. Upper semicontinuity of g now follows from upper 
semicontinuity of h, since g differs from h by a constant. | 


Recall that a continuous function f:[0,«) — R is almost periodic if for every 
€ > 0 the set {n: || f(x) — f(x + n)ll,, < €} is syndetic, i.e. has bounded gaps. For 
L = 0, let T, f(x) = f(x + L). It may easily be*shown that if f is almost periodic 
then X = { T, f: L = 0} is compact in the uniform norm. Moreover {7;}, . 9 acts 
as a minimal isometric flow on X (with respect to the uniform norm) and f(x) may 
be recovered by looking at the values at 0 of the members in the orbit of f: 
f(x) = T, f(O) = g(T. f), where g is the (continuous) function that assigns to a 
member of X its value at 0. 

More generally, if CX, {7,}) is a minimal isometric flow on a compact space, 
y €X and g: X — R is continuous, the function f(t) = g(7T,y) may be shown to 
be almost periodic. Hence the almost periodic functions are exactly those that 
arise (in this manner) from isometric flows on compact spaces. The following 
immediate corollary to Theorem C makes use of this fact. 


Theorem D. Let F :[0,) > R be almost periodic. If there exists K < © such that 
|{> FW) dt| < K for all L > 0, then H(s) = [> F(t) dt is almost periodic. 


Proof: We have an isometric flow on a compact space CX, {7,}), a continuous 
function f on X, and a point x © X such that F(t) = f(T,x). The boundedness 
condition in the theorem is now exactly as in Theorem C. Hence there exists a 
continuous g on X such that H(s) = [>T, f(x) dt = g(x) — g(T,x), which implies 
in particular that A is almost periodic. 7 
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A Mean Value Theorem 


Tadashi F. Tokieda 


Several theorems go by this name. The present note adds to the assortment an 
unusual variant (Theorem 1), which involves the shape of the underlying region in 
an interesting way. 

We work in Euclidean spaces, although Lemma 2 and the second inequality of 
Lemma 3 carry over to general Riemannian manifolds. V and | | denote gradient 
and norm with respect to the standard inner product <, >, and od stands for 
boundary. All our functions are real-valued. A gradient curve of a function f is an 
integral curve of Vf. 


Theorem 1. Let f be a C'-function on a closed ball B. Then there exists b € B at 
which |Vf(b)| -diam(B) = max f — minf. 


The proof is obtained via Lemmas 2 and 3. 


Lemma 2. Let f be a C'-function without critical points on a compact region B. Then 
every gradient curve of f begins and ends on OB. 


Proof: Say a gradient curve y(s) is defined for s from s_ to s,. We have 


Jim f(y(s)) — lim f(y(s)) = [ (Vf, dy) 


= fivs ldy| because y is tangent to Vf (*) 
Y 


> min|Vf| - length(y). 


On compact B, f is bounded, so if f has no critical points (min|Vf| > 0), (*) 
shows that length(y) is finite and y(s_,) exist. Unless both y(s_) and y(s_,) lie on 
0B, y can be extended beyond s_ or s, by the existence theorem for solutions of 
differential equations, contradicting the choice of s ,. a 


Remark. In Lemma 2, compactness is indispensable: think of the height function 
on an infinite vertical cylinder. 


Lemma 3. Let f be a C'-function on a closed ball B. Then 
max f — minf 


in| V 
min|Vf| < iat) 


< max|Vf|. 


Proof: First inequality: If f has critical points on B, then min|Vf| = 0. Otherwise 
consider the gradient curve y through the center of B. y reaches 0B by Lemma 2, 
so that length(y) > diam(B); combine this with (*) to get 


max f — minf > min|Vf| - diam(B). 
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Second inequality: Let / be a line segment that joins a minimum and a maximum 
of f. Since length(/) < diam(B), 


max f — minf = [<P dl) < [ivr |di| < max|Vf| - diam(B). ei 
I I 


Remark. In Lemma 3, the first inequality is true only on a ball: f(x,y) =x isa 
counterexample on [0,1] x [0,1]. The second inequality holds on any convex 
region. Both become equalities for affine functions on balls. 


Theorem 1 is now immediate: 


Proof of Theorem 1 Apply Lemma 3 and the intermediate value theorem to [Vf]. 
LI 


I do not know how close the mean value property of Theorem 1 comes to 
characterizing balls. However, Theorem 1 does admit a partial converse. To state 
it, we need a definition. 

The width w,(e) of a compact region B in the direction of a unit vector e is 
defined as follows. ‘Sandwich’ B by a pair of parallel planes perpendicular to e; 
w,(e) is the distance between these planes: 


w,(e) = max<e,r) — min<e,r). 
reB reB 


B has constant width if w,(e) has the same value for all directions e. A ball has 
constant width, but there are shapes of constant width that are not balls (e.g., 
Reuleaux’s tetrahedron). 


Aside. Why are lids on manholes round? Answer: because a lid whose rim is not a 
curve of constant width can fall into the hole if (un)suitably rotated. Of course, the 
lid and the hole need not be circular; any shape of constant width would be safe. 


Return to the partial converse to Theorem 1. 


Theorem 4. Let B be a compact region such that for every linear function f on it, there 
exists b € B at which |Vf(b)| - diam(B) = max f — minf. Then B has constant width. 


Proof: Suppose B has maximal width in the direction of e,, minimal width in the 
direction of e_, and w,(e_) < w,(e,). Then the linear function f(r) = (e_,r> 
violates the assumed property of f, as |Vf| = 1, diam(B) = w,(e,), max f — minf 
= w,(e_). is] 
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On an Example of Jacobson 


B. Sury 


In Vol. II of Nathan Jacobson’s celebrated book Ph, there appears the following 
exercise on p. 49: 


Let FE, be the field with p elements, and P = F(x, y) where x, y are indeterminates. 
Let E be the subfield F(x? —x, y? — x). Show that [P: E] = p’, that P/E is not 
separable, and that P/E contains no purely inseparable element. 


Now, it is seen immediately that Jacobson’s example is really a nonexample. 
Surprisingly, none of the other standard graduate texts seem to give an example, 
although one can be found in [1, Ex. 17, Ch. V]. Here is another: 


Example. Let P = F,(x, y) and let E be the subfield F(x? — x, y?x). Then 


(Gi) [P: E]= 
(ii) P/E is not separable, and 
(iii) P/E contains no purely inseparable element over E except those contained 
in E. 


Recall that an element x in an algebraic closure K of a field K is separable if 
its minimal polynomial f(T) in K[T'] has all roots (in K) simple. 

It is said to be purely inseparable over K if it is fixed by all K-automorphisms 
of K. More generally, an algebraic extension L of K is said to be purely 
inseparable if the only elements of L that are separable over K are the elements 
of K itself. Any algebraic extension L of K is built in two stages: K CL,,, CL, 
where L,,, is separable over K, and L is purely inseparable over L,,, 


sep 
One has as a consequence of this definition: 


Let x € K, and let f(T) be its minimal polynomial over K. Then, the following 
statements are equivalent: 


(i) x is not separable over K; 
(ii) The derivative f'(T ) is the zero polynomial; and 
(iii) K is of characteristic p > 0, and f(T) € K[T?]. 


Under any of these equivalent hypotheses, if n is the smallest integer such that 
x?” EK, then the minimal polynomial over K is f(T) = T? — x”. 


We return to our example now. 


=Fi(4,y) DE =E6E,(, ae where a = x? —x, b = yx. Then, over E, y satisfies 
| * polynomial 3(T) = + rT?(P-) — 5, where r= b/a and s = b? /a. Also, 
P= E(y). 
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We note: 


(a) x is separable over E and y is inseparable over E. 

The separability of x follows from the preceding remarks by looking at the 
polynomial 7? — T — (x? — x). This is a polynomial over E satisfied by x. In fact, 
the Artin-Schreier Theorem [3, Ch. 8] shows that this polynomial is irreducible and 
is the minimal polynomial of x over E. However, we do not need this fact for the 
proof. 

The inseparability of y is a consequence of the observation that T? — y? is the 
minimal polynomial of y over the field E(x). 


(b) y is not purely inseparable over jae 
As y? €€ E, one has also y? ¢ € E; otherwise, from g(y) = 0 one concludes 
y?-) & E, which would lead to the erroneous conclusion y? € E. 


(c) x is not a p-th power in P. 
This is easy to check by a simple comparison of like powers of x in view of the 
algebraic independence of x and y over F,,. 


Suppose t € P\E is purely inseparable over E. Then t? € E (because if 
t? © E for some n > 2, then since the degree of P = E(y) over E is at most p’, 
n < 2. But, if n # 1, then P would be purely inseparable, a contradiction). 

Look at P D> E(t) > E. Now, the minimal polynomial of t over E is T? — ??, 
and [E(t): E] = p. Note that a? € E for all a € E(t). 

Let [P: E(t)] =1, say. If the minimal polynomial of y over E(t) is f(T) = 
Yj_ a,T', then y satisfies the polynomial f(T)? = L)_,a?T’’. As this is the 
minimal polynomial of y over E, f(T)? divides g(T). If f(T)’Xu,T' = T? + 
rT? — g, one gets u; = 0 if i # 0 mod p. Renaming u;,, as vu;, the equation 
vi aPT PLP luli? =T? + rT?-» — ¢ gives inductively that v, = sb? for some 
b, € P. Therefore, comparing the coefficients of T?’?~") on both sides, we see 
r = sv?’ for some v € P, This means that x is a p-th power in P, which is a 
contradiction. 

Therefore, P has no purely inseparable elements outside of E. 


Remarks. As a consequence of the proof, it is clear that T? + rT??~ — is the 
minimal polynomial of y over E. The extension P of £ is built up in two steps 
PD>E(x)>E with P purely inseparable of degree p over E(x) and E(x) 
separable of degree p over E. 
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1. INTRODUCTION. The evolution of field theory spans a period of about 100 
years, beginning in the early decades of the 19th century. This period also saw the 
development of the other major algebraic theories, namely group theory, ring 
theory, and linear algebra. The evolution of field theory was closely intertwined 
with that of the other three theories, as we shall see. 

Abstract field theory emerged from three concrete theories—what came to be 
known as Galois theory, algebraic number theory, and algebraic geometry. These 
were founded, and began to flourish, in the 19th century. Of some influence in the 
rise of the abstract field concept were also the theory of congruences and (British) 
symbolical algebra. The 19th century’s increased concern for rigor, generalization, 
and abstraction undoubtedly also had an impact on our story. 

In this paper we discuss the sources of field theory as well as some of the main 
events in its evolution, culminating in Steinitz’s abstract treatment of fields. 


2. GALOIS THEORY. For three millennia (until the early 19th century) algebra 
meant solving polynomial equations, mainly of degrees up to 4. Field-theoretic 
ideas are implicit even here. For example, in solving the linear equation ax + b = 0, 
the four algebraic operations come into play and hence implicitly so does the 
notion of a field. In the case of the quadratic equation ax* + bx + c = 0, its 
solutions, x = (—b + vb* — 4ac)/2a, require the adjunction of square roots to 
the field of coefficients of the equation. The concept of adjunction of an element 
to a field is fundamental in field theory. 

Field-theoretic notions appear much more prominently, even if at first still 
implicitly, in the modern theory of solvability of polynomial equations. The 
groundwork was laid by Lagrange in 1770, but the field-theoretic elements of the 
subject were introduced by Abel and Galois in the early decades of the 19th 
century. Ruffini’s 1799 proof of the insolvability of the quintic had a major gap 
because he lacked sufficient understanding of field-theoretic ideas [16]. 

Such ideas were starting points in Galois’s 1831 “Mémoire sur les conditions de 
résolubilité des équations par radicaux” [16, p. 305]: 


One can agree to regard all rational functions of a certain number of 
determined quantities a priori. For example, one can choose a particular root 
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of a whole number and regard as rational every rational function of this 
radical. When we agree to regard certain quantities as known in this manner, 
we shall say that we adjoin them to the equation to be resolved. We shall say 
that these quantities are adjoined to the equation. With these conventions, 
we Shall call rational any quantity which can be expressed as a rational 
function of the coefficients of the equation and of a certain number of 
adjoined quantities arbitrarily agreed upon:.. . One can see, moreover, that 
the properties and the difficulties of an equation can be altogether different, 
depending on what quantities are adjoined to it. 


It is clear that Galois has a good insight into the fields that we would denote today 
by F(u,,u5,...,u,), obtained by adjoining the quantities u,,u,,...,u, to the 
(field of) coefficients of an equation. In the specific example mentioned, he has in 
mind a quadratic field, O(Vd). 

Galois was the first to use the term “adjoin” in a technical sense. The notion of 
adjoining the roots of an equation to the field of coefficients is central in his work 
[9], [16]. 

One of the fundamental theorems of the subject proved by Galois is the 
Primitive Element Theorem. This says (in our terminology) that if E is the splitting 
field of a polynomial f(x) over a field F, then E = F(V) for some rational 
function V of the roots of f(x). Galois used this result to determine the Galois 
group of the equation f(x) = 0 [1], [16]. The Primitive Element Theorem was 
essential in all subsequent work in Galois theory until Artin bypassed it in the 
1930s by reformulating Galois theory, for he felt that the theorem was not intrinsic 
to the subject [9]. 


3. ALGEBRAIC NUMBER THEORY. The central field-theoretic notion here, due 
independently to Dedekind and Kronecker, is that of an algebraic number field 
Q(a), where a is an algebraic number. How did it arise? Mainly from three major 
number-theoretic problems: Fermat’s Last Theorem (FLT), reciprocity laws, and 
representation of integers by binary quadratic forms. Although all three problems 
have to do with the domain of (ordinary) integers, in order to deal with them 
effectively it was found necessary to embed them in domains of what came to be 
known as algebraic integers. The following examples illustrate the ideas involved. 


(a) To prove FLT for (say) n = 3, that is, to show that x° + y> =z? has no 
nonzero integer solutions, one factors the left side to obtain the equation 
(x + yx + yw)(x + yw?) =z°, where w is a primitive cube root of unity, w = 
(—1 + V3 i)/2. This is now an equation in the domain D = {a + bw:a,b € Z} of 
algebraic integers. This approach to FLT (for n = 3) was essentially used by Euler 
and later by Lamé and others [5]. 


(b) Gauss’s quadratic reciprocity law appeared in his Disquisitiones Arithmeticae of 
1801. It says that x* =p (mod gq) is solvable if and only if x* =q (modp) is 
solvable, unless p = g = 3 (mod 4), in which case x* = p (mod gq) is solvable if and 
only if x* = g (mod p) is not. Here p and gq are odd primes [8]. 

Gauss and others tried to extend this result to “higher” reciprocity laws. For 
example, for cubic reciprocity one asks about the relationship between the solvabil- 
ity of x° = p (mod q) and x° = q (mod p). These higher reciprocity-type problems 
are much more difficult to deal with than quadratic reciprocity. Gauss remarked 
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that [8, p. 108]: 


The previously accepted laws of arithmetic are not sufficient for the founda- 
tions of a general theory [of higher reciprocity]... Such a theory demands 
that the domain of arithmetic be endlessly enlarged. 


His comments were no idle speculation. In fact, he himself began to implement the 
above “programme” by formulating and proving a law of biquadratic reciprocity. To 
do that he extended the domain of arithmetic by introducing what came to be 
known as the gaussian integers G = {a + bi: a,b € Z}. He could not even formu- 
late such a law without introducing G [8]. 


(c) The problem of representing integers by binary quadratic forms, namely 
determining when n = ax? + bxy + cy” (a,b,c € Z), goes back to Fermat. In 
particular, Fermat asked and answered the question: which integers n are sums of 
two squares, n = x* + y’? In the Disquisitiones Gauss studied the general problem 
very thoroughly, developing a comprehensive and beautiful, but very difficult, 
theory. To gain a deeper understanding of Gauss’s theory of binary quadratic 
forms, Dedekind found that he, too, needed to extend the domain Z of integers. 
For example, even in the simple case of representing integers as sums of two 
squares, it is the equation (x + yi(x — yi) =z* rather than x? + y* =z? that 
yields conceptual insight [1], [10]. 


Dedekind’s ideas. The fundamental question in extending the domain of ordinary 
arithmetic to “higher” domains is whether such domains behave like the integers, 
namely whether they are unique factorization domains (UFDs). It is this property 
that facilitates the solution of problems (a)—(c). While the domains D and G 
introduced above are UFDs, most domains that arise in connection with the three 
number-theoretic problems we have described are not. For example, when we 
factor the left side of x” + y” =z” for n > 23, the resulting domains are never 
UFDs. To rescue unique factorization in such domains Dedekind introduced (in 
Supplement X (1871) to Dirichlet’s Vorlesungen iiber Zahlentheorie) ideals and 
prime ideals, and showed that every ideal in these domains is a unique product of 
prime ideals [10]. 

But what are the domains with restored unique factorization? To answer 
that—one of the fundamental questions of his theory—Dedekind needed to 
introduce fields, in particular algebraic number fields O(a), where a is a root of a 
polynomial with integer coefficients. These were the natural habitats of his 
domains, just as the rationals are the natural habitat of the integers. The domains 
in question were then defined as “the integers of Q(a),” namely those elements of 
Q(a) that are roots of monic polynomials with integer coefficients. Dedekind 
showed that they form a commutative ring with identity and without zero divisors 
whose field of quotients is O(a) [3], [10], [13]. 

Given Dedekind’s predisposition for abstraction—a rather rare phenomenon in 
the 1870s, he placed his theory in a broader context by giving axiomatic definitions 
of rings, fields, and ideals. Here is his definition of a field [1, p. 117]: 


By a field we will mean every infinite system of real or complex numbers so 
closed in itself and perfect that addition, subtraction, multiplication, and 
division of any two of these numbers again yields a number of the system. 
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To Dedekind, then, fields were subsets of the complex numbers, which is, of 
course, all he needed for his theory of algebraic numbers. Still, an axiomatic 
definition in number theory /algebra, even in this restricted sense, is remarkable 
for that time. Also remarkable are Dedekind’s use of infinite sets (“systems”), 
which predates Cantor’s, and his “descriptive” rather than “constructive” defini- 
tion of a mathematical object as a set of all elements of a certain kind satisfying a 
number of properties. 

The field concept was a unifying mathematical notion for Dedekind. Before his 
definition of a field he says [4, p. 131]: 


In the following paragraphs I have attempted to introduce the reader into a 

higher domain, in which algebra and number theory interconnect in the most 

intimate manner.... I became convinced that studying the algebraic relation- 

ship of numbers is most conveniently based on a concept that is directly 

connected with the simplest arithmetic properties. I had originally used the 
_ term “rational domain,” which I later changed to “field.” 


Hilbert remarked that Gauss, Dirichlet, and Jacobi had also expressed their 
amazement at the close connection between number theory and algebra, on the 
grounds that these subjects have common roots in (as Dedekind would put it) the 
theory of fields [4]. 

Dedekind produced several editions of his groundbreaking theory of ideal 
decomposition in algebraic number fields. In his mature 1894 version (4th edition 
of Dirichlet’s Zahlentheorie) he included important concepts and results on fields 
—nowadays standard—such as [9, pp. 130-132]: 


(i) If S is any subset of the complex numbers containing the rationals, the 
intersection of all fields containing S is a field; it is called “rational with 
respect to S.” 

(ii) He defines field isomorphism, calling it “permutation of the field,” as a 
mapping of a field & onto a field F' that preserves all four operations of 
the field. He observes that if F is nonzero, the mapping is one-one. He also 
notes that the mapping is the identity on Q. 

(iii) If E is a subfield of K, he defines the degree of K over E as the dimension 
of K considered as a vector space over E&. He shows that if the degree is 
finite then every element of K is algebraic over E. 


Kronecker’s ideas. Kronecker’s work was broader but much more difficult than 
Dedekind’s. He developed his ideas over several decades, beginning in the 1850s, 
trying to frame a general theory that would subsume algebraic number theory and 
algebraic geometry as special cases. In his great 1882 work Grundzige einer 
arithmetischen Theorie der algebraischen Grossen he developed algebraic number 
theory using an approach entirely different from Dedekind’s. One of his central 
concepts was also that of a field—he called it “domain of rationality,” defined as 
follows [9, p. 127]: 


The domain of rationality (R’, R’, R”,...) contains every one of those 
quantities which are rational functions of the quantities R’, R", R”,... with 


integer coefficients. 


Note how different Kronecker’s “definition” of a field is from Dedekind’s! It is a 
constructive description, rather, than the kind of definition that would be accept- 
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able to us today. But it was dictated by Kronecker’s views on the nature of 
mathematics. | 

Kronecker rejected irrational numbers as bona fide entities since they involve 
the mathematical infinite. For example, the algebraic number field Q(/2) was 
defined by Kronecker as the quotient field of the polynomial ring Q[x] relative to 
the ideal generated by x* — 2, though he would have put it in terms of congru- 
ences rather than quotient rings. These ideas-contain the germ of what came to be 
known as Kronecker’s Theorem, namely that every polynomial over a field has a 
root in some extension field [9], [13]. 

It is interesting to compare this definition of Q(V2) with Cauchy’s definition in 
the 1840s of the complex numbers as polynomials over the reals modulo x? + 1 
(and compare the latter with Gauss’s integers modulo p). Cauchy’s rationale was to 
even “algebraic” definition of complex numbers that would avoid the use of 

=<1., 


Dedekind vs. Kronecker. Dedekind and Kronecker were great contemporary alge- 
braists. Both published pathbreaking works on algebraic number theory. But their 
approaches to the subject were very different. Both were guided in their works by 
their “philosophies” of mathematics, and these too were very different [13]. 
Kronecker was perhaps the first preintuitionist, Dedekind likely the first prefor- 
malist (cf. Kronecker’s ‘God made the [positive] integers, all the rest is the work of 
man” with Dedekind’s ‘“[The natural] numbers are a free creation of the human 
mind’’). To Kronecker mathematics had to be constructive and finitary. Dedekind 
did not hesitate to use axiomatic notions and the-infinite. While Kronecker made 
frequent pronouncements on these topics, Dedekind made few; his views became 
known mainly from his works—conceptual and abstract. Some examples: 


(i) Since Kronecker’s domains of rationality had to be generated by finitely 
many elements (the R’, R’, R”,...), his definition would not admit the 
totality of algebraic numbers as a field. Dedekind had no problem in 
considering the set of all complex numbers that are roots of polynomial 
equations with integer coefficients (viz. the set of all algebraic numbers) as 
a bona fide mathematical object. 

(ii) On the other hand, Kronecker put no restriction on the nature of the 
entities R’, R", R”,...—they could, for example, be indeterminates or 
roots of algebraic equations. So Q(x) was a legitimate field to Kronecker. 
In fact, the adjunction of indeterminates to a field was a cornerstone of his 
approach to algebraic number theory. Dedekind, recall, defined his fields 
to be subsets of the complex numbers (but see Section 4). 

(iii) Since Kronecker did not accept a (say) as a legitimate number, he 
identified O(7) with Q(x) (x an indeterminate), thus claiming that tran- 
scendental numbers are indeterminate! To Dedekind Q(7) was a perfectly 
legitimate entity not requiring any assistance from Q(x). 


4. ALGEBRAIC GEOMETRY. The examples of fields we have come across so far 
have been mainly fields of numbers. Here we encounter principally fields of 
functions, in particular, algebraic functions and rational functions. The ideas are 
due mainly to Kronecker and Dedekind-Weber. 


Fields of algebraic functions. Algebraic geometry is the study of algebraic curves 
and their generalizations to higher dimensions, algebraic varieties. An algebraic 
curve is the set of roots of an algebraic function, that is, a function y = f(x) 
defined implicitly by a polynomial equation P(x, y) = 0. 
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Several approaches were used in the study of algebraic curves, notably the 
analytic, the geometric-algebraic, and the algebraic-arithmetic. In the analytic 
approach, to which Riemann (in the 1850s) was the major contributor, the main 
objects of study were algebraic functions f(w, z) = 0 of a complex variable and 
their integrals, the so-called abelian integrals. It was in this connection that 
Riemann introduced the fundamental notion, of a Riemann surface, on which 
algebraic functions become single-valued. Riemann’s methods, however, were 
nonrigorous, relying heavily on the physically obvious but mathematically question- 
able Dirichlet Principle [3], [11]. 

Dedekind and Weber, in their important 1882 paper “Theorie der algebrais- 
chen Funktionen einer Veranderlichen,” set for themselves the task of making 
Riemann’s ideas rigorous, or, as they put it [11, p. 154]: 


The purpose of the[se] investigations... is to justify the theory of algebraic 
functions of a single variable, which is one of the main achievements of 
Riemann’s creative work, from a simple as well as rigorous and completely 
general viewpoint. 


To accomplish this, they carried over to algebraic functions the ideas that Dedekind 
had earlier introduced for algebraic numbers. Specifically, just as an algebraic 
number field is a finite extension Q(a) of the field O of rational numbers, so 
Dedekind and Weber defined an algebraic function field as a finite extension 
K = C(z\(w) of the field C(z) of rational functions (in the indeterminate z). That 
is, w is a root of a polynomial p(t) = ay + a,t + a,t? + +++ +a,t", where a; € C(z) 
(we can take a, € C[z]). Thus w = f(z) is an algebraic function defined implicitly 
by the polynomial equation P(z,w) = a, + aw + a,w? + +++ +a,w” = 0. In fact, 
all the elements of K = C(z)w) = C(z, w) are algebraic functions. 

Now let A be “the integers of K”; that is, A consists of the elements of 
K = C(z\w) that are roots of monic polynomials over Cz] (cf. “the integers of 
O(a),” Section 3). By analogy with the case of algebraic numbers, here too A is an 
integral domain and every nonzero ideal of A is a unique product of prime ideals 
[1], [3]. Incidentally, the meromorphic functions on a Riemann surface form a field 
of algebraic functions, with the entire functions as their “integers.” 

Dedekind and Weber were now ready to give a rigorous, algebraic definition of 
a Riemann surface S of the algebraic function field K: It is Gin our terminology) 
the set of nontrivial discrete valuations on K. The finite points of S correspond to 
the ideals of A; to deal with points at infinity of S, they introduced the notions of 
“place” and “divisor” [3]. They developed many of Riemann’s ideas on algebraic 
functions algebraically and rigorously. In particular, they gave a rigorous algebraic 
proof of the important Riemann-Roch Theorem [1], [3], [11]. 

Dedekind and Weber were at heart algebraists. They felt that algebraic function 
theory is intrinsically an algebraic subject, hence it ought to be developed alge- 
braically. As they put it: “In this way, a well-delimited and relatively comprehen- 
sive part of the theory of algebraic functions is treated solely by means belonging 
to its own domain” [11, p. 156]. 

Beyond their technical achievements in putting major parts of Riemann’s 
algebraic function theory on solid ground, the conceptual breakthrough by 
Dedekind and Weber lay in pointing to the strong analogy between algebraic 
number fields and algebraic function fields, hence between algebraic number 
theory and algebraic geometry. This analogy proved most fruitful for both theories. 
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Another noteworthy aspect of their work was its generality, in particular its 
applicability to arbitrary fields; see [6], [15]. 


Fields of rational functions. As noted earlier, algebraic geometry is the study of 
algebraic varieties. An algebraic variety is the set of points in R” (or C”) satisfying 
a system of polynomial equations f,(x,,x,,...,x,) =0,i =1,2,..., ; the Hilbert 
basis theorem implies that finitely many equations will do. The ideal structure of 
the ring R[x,,...,x,] (or C[x,,..., x, ]) to which the polynomials f,(x,, x5,..., x,) 
belong is fundamental for the understanding of the algebraic variety, as is the 
“natural habitat” of that ring—its field of quotients R(x,,...,x,) (or 
C(x,,...,X,)). These are the fields of (formally) rational functions. We have seen 
that such fields were also introduced by Kronecker in connection with his work in 
algebraic number theory [6], [13]. 


5. CONGRUENCES. Gauss introduced the congruence notation in the Disquisi- 
tiones Arithmeticae of 1801 and showed (among other things) that one can add, 
subtract, multiply, and divide congruences modulo a prime p, in effect that the 
integers modulo p form a field—a finite field of p elements. Inspired by Gauss’s 
work on congruences, Galois introduced finite fields with p” elements in an 1830 
paper entitled “Sur la theorie des nombres.” 

Galois’s aim was to study the congruence F(x) = 0 (mod p) as a generalization 
of Gauss’s quadratic congruences (cf. Gauss’s quadratic reciprocity law). Here F(x) 
is a polynomial of degree n that is irreducible mod p, i.e., F(x) is irreducible over 
the field Z,. Galois showed that F (x) has no integral roots [mod p]. His conclu- 
sion was that [7, pp. 277-278]: 


One should therefore regard the roots of this congruence as some kind of 
imaginary symbols..., symbols whose employment in calculation will often 
prove as useful as that of the imaginary v— 1 in ordinary analysis. 


He continues: 


Let i [an arbitrary symbol, not the complex number i] denote one of the 
roots of the congruence F(x) = 0, which can be supposed to have degree n. 
Consider the general expression 


atajita,i? t+: +a,_,i""'," (**) 


where 4,4,,45,...,a,_, represent integers [mod p]. When these numbers 
are assigned all their possible values, expression (**) takes on p” values, 
which possess, as I shall demonstrate, the same properties as the natural 
numbers in the theory of residues of powers. 


Galois did, indeed, show that the expressions (**) form a field, now called a Galois 
field. He also showed that (in our terminology) the multiplicative group of that 
field is cyclic [1], [7], [13]. In an 1893 paper entitled “A doubly-infinite system of 
simple groups,” E. H. Moore characterized the finite fields [12]. 


6. SYMBOLICAL ALGEBRA. In the third and fourth decades of the 19th century 
British mathematicians, notably Peacock, Gregory, and De Morgan, created what 
came to be known as symbolical algebra. Their aim was to set algebra—to them 
this meant the laws of operation with numbers, negative numbers especially—on 
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an equal footing with geometry by providing it with logical justification. They did 
this by distinguishing between arithmetical algebra—laws of operation with positive 
numbers, and symbolical algebra—a subject newly created by Peacock, which dealt 
with laws of operation with numbers in general. 

_ Although the laws were carried over verbatim from those of arithmetical 
algebra, in accordance with the so-called Principle of Permanence of Equivalent 
Forms, the point of view was remarkably modern. Witness Peacock’s definition of 
symbolical algebra, given in his Treatise of Algebra of 1830 [14, p. 35]: 


The science which treats of the combinations of arbitrary signs and symbols 
by means of defined though arbitrary laws. 


Quite a statement for the early 19th century! Such sentiments were about a 
century ahead of their time. And of course one did have to wait about a century to 
have what Peacock had preached put fully into practice. Nevertheless, the creation 
of symbolical algebra was a significant development, even if not directly related to 
fields, signalling (according to some) the birth of abstract algebra [2]. 
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problems address on the inside front cover. Submitted problems should include 
solutions and relevant references. Submitted solutions should arrive at that ad- 
dress before February 29, 2000; Additional information, such as generalizations 


and references, is welcome. The problem number and the solver’s name and ad- 
dress should appear on each solution. An acknowledgement will be sent only if a 
mailing label 1s provided. An asterisk (*) after the number of a problem or a part 
of a problem indicates that no solution 1s currently available. 


Problem 10743 [1999; 586] in the June-July 1999 issue was misstated. Here is the corrected 
version. 

10743. Proposed by Calin Popescu, Université Catholique de Louvain, Louvain-La-Neuve, 
Belgium. Let R = )\(—1)'("), where the sum is taken over all i € {0, 1,..., — 1} such 
that i + 1 is a quadratic residue modulo p, and let N = )-(—1)/ (“), where the sum is taken 
over all 7 € {0, 1, ...,2 — 1} such that j + 1 is a quadratic nonresidue modulo p. Prove that 
exactly one of R and N is divisible by p. 


PROBLEMS 


10746. Proposed by Stepan Tersian, University of Rousse, Rousse, Bulgaria. Prove that 


i (over = ey) eal 
0 


for all positive real numbers x and y. 


10747. Proposed by Athanasios Kalakos, Athens, Greece. Find all differentiable functions 
f : K — R that are twice differentiable on an open interval containing 0, have exactly one 
real root, satisfy f(1) = 1, and satisfy f’(f(t)) = 2 f(t) for every t € R. 


10748. Proposed by Itshak Borosh, Douglas A. Hensley, and Joel Zinn, Texas A& M 
University, College Station, TX. Let p and q be prime numbers, and let r be a positive 
integer such that g|(p — 1), q fr, and p > r7—!. Show that for any integers a], 42,...,4r, 


if iat apne = 0 mod p, then | ];_, aj = 0 mod p. 


10749. Proposed by Alain Grigis, Université Paris 13, Villetaneuse, France. Let ABC be 
a triangle with a right angle at B and an angle of 1/6 at A. Consider a billiard path in the 
triangle that begins at A, reflects successively off side BC at P, off side AC at Q, off side 
AB at R, off side AC at S, and then ends at B. 

(a) Show that AP, QR, and SB are concurrent at a point X. 

(b) Show that the angles formed at X measure z//3. 

(c) Show that AX = XP + PQ4+ QX =XR+RS4+SX =2XB. 
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10750. Proposed by Leonard Smiley, University of Alaska, Anchorage, AK. For a positive 
integer m, express )-P-_,(n/ gcd(m, n))x" as a rational function of x. 


10751. Proposed by Emeric Deutsch, Polytechnic University, Brooklyn, NY. Let n be a posi- 
tive integér, and let S,, be the set of all strings a az - - - a, of positive integers satisfying aj = 1 
and aj4+; — a; € {1, —1, —3, —5,...}. For example, S5 = {12345, 12343, 12341, 12323, 
12321, 12123, 12121}. Find |S,|. 


10752. Proposed by Gh. Costovici, Technical University "Gh. Asachi", Iasi, Romania. For 
n €N, let a, and b, be complex numbers, with each b, 4 0. Let s, = ay +a2+---+ay, 
and let t) = (1 — bj /bn41) a1 + (1 — b2/bn41) a2 +--+ + 1 — bn /bn41) an. 

(a) Prove that if limy—so9 Dn41/bn = 1 and acer |Sn —t,|? converges for some g € (0, 1], 
then }-° | an converges. 

(b) Prove that if baer lbn41/bn — 1|’ and s| Kea 9 converge for some 
r € (1, 00), then )~P° | a, converges. 


SOLUTIONS 


A Zeta Function over a Recurrent Sequence 


10486 [1995, 841]. Proposed by Joseph H. Silverman, Brown University, Providence, RI. 
Let a,b > O anda > 1 be real numbers, and define Z(s) = >°,,-7 (aa" + ba") * for 
complex numbers s with positive real part. 

(a) Prove that Z(s) has a meromorphic continuation to all of C. 

(b) Find the poles of Z(s). 

(c) Find the residues of Z(s) at its poles. 


Solution I by David Bradley, University of Maine, Orono, ME. Let o be the real part of s. 
Write 


Z(s) = (a+b) > + >> (aa"+ba") + S° (ba" + aa"). (1) 
n=] n=1 


Without loss of generality, assume that 0 < a < b. We first consider the case |a| > ./b/a. 
We then have the two binomial expansions 


(oat + bamt—8 ae tg ¥ Be ak + Em n(s)) (2) 
(1+ ba-la-2)* Er, mae 
and 
ae baa" m=! 7_»\ ak 
n —n\~S — p-Spy—"s —2nk 
(box + aa ) = (1+ abla" Pab-la-2)° ba . ( k \Sa + Fy «0) (3) 


where m is a fixed positive integer and Emn(s) = O(a~*”"") and Fin.n(s) = O(a7~2™"), 
Since |a| > ./b/a, it follows from (1)—(3) that 


k ak oe) 
roy ntero-r+ $2) (tet Ba) Sects ora) 


n=] 
—s—k pk —s—k ak 
s —s\ a" "bt +b —n(o+2m) 
= (a+b) +> \ aa +e Ye ee (4) 
A= 
Since Em n(s) and Fin.n(s) are analytic foro > —2m, it follows by analytic continuation 


that (4) is valid foro > —2m. Since m is an arbitrary positive integer, we conclude that 
Z(s) has a meromorphic continuation to the entire complex plane. 
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We now calculate the poles and residues from (4). For integer n, let s, = 27in/loga. 
From (4), we see that all singularities of Z(s) are simple poles, and these occur at points of 
the form s, — 2k, where n is an integer and k is a nonnegative integer. The residue of Z(s) 


at the pole s, — 2k is (*7°")a* bk (a- + b-™) /loga. 

If (b/a)*” is an odd power of a, then s, — 2k is an ordinary point, not a pole. Finally, 
although our derivation of (4) is valid only for complex numbers a with |w| > ./b/a, this 
restriction can be eased to |a| > 1 by analytic continuation in a, provided that m remains 
fixed. 

Solution II by Donald A. Darling, Newport Beach, CA. With B = loga, y = 5 log(a/b), 
and 5 = /ab, the function Z(s) takes the form 
oS | 


1 oh 
(25)* 4, cosh" (nB + y) 


Since the real part of s is positive, the function cosh~* (nf + y) satisfies the hypotheses of 
the Poisson summation formula }°°?_., f(n) = op gn oe f (x) cos(27nx)dx. Thus, 


= 1 
“, cosh® (nB + y) 


ah cos(27nx) ca oS [ cos (27n(y — y/B)) F 
oo cosh* (Bx + y) = ged URS cosh* (By) 


2mny on cos(2mny) 2mny °° sin(27ny) 
cos ( p I no cosh’ (By) ay+ Do sin (= yf cosh (By) 
Cos (=) f er); (5) 


f= p ~oo cosh” (By) 


Formula 3.985(1) (p. 540) of I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series 
and Products, Academic Press, 1980 states 


[ cos(27ny) pre (; =) (5 mn) 
eye rs oS 
~oo cosh" (By) BY(s) \2 B 2 £6 


With (5), this yields 


ites SG ee i 
20) = saresge De 008 B )rG+ B yr ) 


_ T7(s/2) in 1 5 eas any : sain : Ss min | 
2pT(s)5* | BI (s)os E 2° Bp ae 
By Stirling’s formula for the gamma function in a vertical strip, the series in the last term of 
(6) converges uniformly in any vertical strip bounded away from the poles of the summands. 


Thus, Z(s) has an analytic continuation to the entire complex s-plane, and the poles and 
residues may be calculated as in the first solution. 


Editorial comment. When |a| > ./b/a, we can let m — oo in (4) to obtain 


—s\ q75 kpk 1 pos k ik 
Z(s) =(a+b)~ oe \ aaa | 


which is valid for all complex s. 
When a = b = 1/2 anda = e”* with the real part of z positive, (6) yields Z(s) = 
yop COSh” *(nzz). For positive integer s, such sums arise in the theory of elliptic 
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functions. For even s (C. B. Ling, On summation of series of hyperbolic functions, SIAM J. 
Math. Anal. 5 (1974) 551-561) and for all positive integral s (I. J. Zucker, The summation 
of series of hyperbolic functions, SIAM J. Math. Anal. 10 (1979) 192-206), Z(s) can be 
evaluated in terms of elliptic functions. In particular, fors = 1,0 < k < 1, andz = 
K(V1 —k?)/K(k), we have °° _., cosh"! (nz) = (0 en *2)2 — (2/7) K(k), 
where K (k) is the complete elliptic integral of the first kind (see B.C. Berndt, Ramanujan’s 
Notebooks, Part II, Springer-Verlag, 1991, p. 102 and p. 138). 


Solved also by D. Cantor, R. J. Chapman (U. K.), R. Holzsager, and the proposer. 


A Matrix of Inequalities 


10599 [1997, 566]. Proposed by Fred Galvin, University of Kansas, Lawrence, KS. Let 
X1,.+.+,;Xm and yj,..., Yn be nonnegative numbers and let (a;;) be an m x n matrix of 
nonnegative numbers with at least one nonzero entry in each row. Suppose that the inequality 
pen AhjXh < dpe Aik Yk holds whenever a;; > 0. Show that )°y", x; < ae, yj. 


Solution by Frank Jelen and Eberhard Triesch, Der Rheinisch-Westfdlischen Technischen 


Hochschule, Aachen, Germany. Let A be the specified matrix, with columns cj, ..., Cy. Let 
x = (X1,...,Xm)? and y = (y1,..., yn), and let 1, denote the column vector of length k 
with entries equal to 1. 
Define b = (b,..., bm)! by bj = max({c7 x: aij > O}; this is well-defined since each 
row contains a positive entry. Consider the linear programs 
minimize 147 — subjectto Az >bandz>0 (1) 
and 
maximize b’ w subject to A’ w <1, and w > 0. (2) 


These linear programs are duals of each other, and (1) has the feasible solution z = y. It 
thus suffices to show that there exists a feasible solution u of (2) with b? u > 1 x, since the 
Duality Theorem then yields its y>blu >t x. 

Consider the nonnegative vector u = (uj,...,Um)! defined by uj = x;/b; if bj > 0 
and u; = 0 otherwise. Clearly b’u = 17 x. | 

For 1 < j <n, define J; = {i: ajj > Oand x; > O}. Fori € Jj, we have bj > cf 
Feasibility of u now follows from 


Cj fu= ayn = Davy = r = aa 


tel; a ie tel; 


x > 0. 


Solved also by the proposer. 


A Complex Determinant 


10601 [1997, 566]. Proposed by Wen-Xiu Ma, Universitdt-GH Paderborn, Paderborn, 


Germany. Letn > 1 be an integer and let a), a2, ..., a, be complex numbers. Show that 
1 ay ate: ae 
1 a, ay oe aes 
) 2n—1 
Gn SG. 8 an (n—1)/2 4 
= (-1)" (aj — aj)". 
2n—2 J 
1 2a, +++ (Q2n—1)a;"~ A 
1 2a, +++ (2n- a2 rs 
O 1 2a, ++» (Qn —1)a2"-4 
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Solution I by Robin J. Chapman, University of Exeter, Exeter, UK. Consider ie yaneer 
monde matrix for 2m complex numbers a), ..., @2,, in which the 7, j-entry is a, . The 
determinant is |] <j <p<2n(@k — @j). Subtracting row j from row n + j turns rown + j 
into 
2n—2 
2n—2- 
(Qn+.j — 4j)| 9,1, dn4j +4j,°°-, 2, Gna jj 


These row operations do not change the determinant. When ay+; 4 a; for each j, we may 
cancel TTj=1 (dn+4j; — a;) from the two expressions for the determinant to obtain 


1 a at vee laa 
1 a as vee re 
An a? — a=" 
2n— i 2n—2—r = IT (ax — aj). (*) 
QO 1 Gayytai = Diy a, ai L<j<k<2n 
kfén+j 
2n—2 (2n—2-r 
1 an4+2 + 42 De 0 442 a4 


2n—2 ; —2— 
0 1 Q2n + an one ar a "al 


n 


By continuity, (*) is also valid when an+; = a;. Setting aj = an+; for each j in (*) yields 
the desired result, since each difference a, — a; for j < k appears four times in the product 
on the right side of (*), once in reverse order. 


Solution II by Joseph J. Rushanan, The MITRE Corporation, Bedford, MA. We use the 
techniques from J. J. Rushanan, On the Vandermonde matrix, this MonTHLy 96 (1989) 921- 
924. Let A be the matrix whose determinant is given in the problem statement. Given a 


complex polynomial f defined by f(z) = paar c;z', let f =[co,..-, Con—1]?. Then 


= [F(a1), F(@2),---5 Gn), Fas f'@), 5 Fn] 


Let f(z) = Th (z —a;) for1 < k <n, and let f(z) = fe—-n(z) []j_) (2 — a) for 
n+1< k < 2n. Since f, is monic with degree k — 1, the matrix U = [f, ---f,] is 
upper-triangular with 1s on the diagonal. Furthermore, L = AU is lower-triangular, since 
fx(aj) =0= Fax (aj) ifl <j <k <nand fnix(a;) = 0 for all 7. 

Thus det A is the product of the diagonal terms of L, which are f,(a,) and f, 44 (4k) for 
1 <k <n. These terms consist only of factors of the form (a; — a;) with r #4 5. A typical 
term (as; — a;) with r < s appears in f;(as) once, appears negated in f, 4,(ay), and appears 
squared in f/,4,(as). This shows that A has the desired determinant. 

The technique generalizes to higher derivatives. 


Editorial comment. David Callan and Wai Wah Lau observed that generalizations involving 
higher derivatives have appeared in the literature, such as on page 400 of R. A. Horn and 
C. R. Johnson, Topics in Matrix Algebra, Cambridge Univ. Press, 1991. Several others 
noted that the formula holds for a1, ..., a, in an arbitrary commutative ring. Indeed, every 
polynomial identity in Z[a,, ..., a] holds over arbitrary commutative rings. 


Solved also by M. Benedicty, J. C. Binz (Switzerland), G. L. Body (U. K.), D. Callan, L. L. Foster, J.-P. Grivaux (France), R. 
Holzsager, G. Keselman, N. Komanda, O. Kouba (Syria), W. W Lau, J. H. Lindsey II, G. R. Miller, M. McKee, J. H. Nieto 
(Venezuela), G. Peng, C. Popescu (Belgium), R. Richberg (Germany), J. H. Smith, P. Szeptycki, A. Tissier (France), J. Van hamme 
(Belgium), Wyoming Problems Circle, WMC Problems Group, and the proposer. 
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On a Convolution of Eulerian Numbers 


10609 [1997, 664]. Proposed by Donald E. Knuth, Stanford University, Stanford, CA. 
Let a(l,m,n) = Ss Cyd +m — k)"-*(k — 1*. Prove that )“7_,a(l,m,n) = 
((m +n + 1)/2)a(n, m,n) — ((m + 1)/2)m" 

Solution by David Callan, University of Wisconsin, Madison, WI. We compare coefficients 
of m/ to prove the desired identity. First we express a(/, m, n) using Eulerian numbers. The 


classical Eulerian number is the number of permutations of [n] = {1, ..., n} consisting 
of k ascending runs (with descents at k — 1 locations). These are counted by placing 
n numbered objects into kK numbered boxes to avoid properties P;,..., Py, where P; is 


the property that box i is empty or has only objects greater than those in the preceding 
box. A careful application of the inclusion-exclusion principle yields the formula ) = 


yt (1 ("*)& — fy". Note that (5 = 

The definition of (7 } yields ie ate } = n!. We also need reo k (7 = 5(n +1)!. To 
prove this combinatorially, we alter each permutation of [n] with k runs by placing n + | 
at the end of one run. This can be done in k ways and yields a permutation of [n + 1]. A 
permutation of [n + 1] arises in this way if + 1 is at the end, not if n + 1 is at the start, and 


otherwise if and only if the element preceding n + 1 is greater than the element following 
it. Thus we obtain half the permutations of [n + 1]. 


We claim that 
a(l,m,n) = Er); " m!, (*) 


j=0 i=0 


which we prove by comparing coefficients of m/. By applying the binomial theorem to 
(1—k+m)"—*, we extract the desired coefficient as ),_)(—1) (9) ce) (lI —k)"~/. Rear- 


ranging binomial coefficients converts this to (“) ye) (om )( —k)"~/. After can- 
celing the ("), it remains only to rewrite the sum as weer ee We use our formula for Eu- 


h(nt+l 
lerian numbers, let k = i — h, apply the elementary identity )*,_9(—1)"(";_) = (-1)’ ("), 
and finally one k with 1 — k to obtain 


(77) a e/a cur Sesame) 


= i=0 h= k=0 


=yov'(" jer = y- De" Vank mynd. 
k=0 


Using (*), we compute coefficients of m/ in the desired identity. For j = n + 1, the 


contributions cancel. For j = n, the coefficient is n. For j <n, the coefficient of m/ in 
721 41, m, n) divided by (") is 


nl . l n . n : n ; 
oe = Lert-aL("))=e+n eb - rT] 
=1j=1' * i=l ix1\ ! i=1\ ! i=l : 
(ai ae fe a 

ap 

For j <n, the coefficient of m/ in ((m +n+ 1)/2)a(n, m,n) — ((m + 1)/2)m" is 


SQ aE) 


_nt+! 1 _(n ~aAtjt+i 
5 ("Jon -p+5(," Jo-i+pi=(“)o- ptt 
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1 
=(nt+ I j)!-S@-jt+D!=@-j) 


Editorial comment. From (*) we infer that a(/, 1, 1) is the total number of arrangements 
with at most / ascending runs that can be formed from subsets of [n]. 


Solved also by R. J. Chapman (U. K.), Q. Darwish (Oman), H.-J. Seiffert (Germany), and the proposer. 


A Variation on Additive Bases 


10610 [1997, 664]. Proposed by Richard Hall, University of Portsmouth, Portsmouth, 
England. Given a positive integer m, let C(m) bé the greatest positive integer k such that, 
for some set S of m integers, every integer from | to k belongs to S or is a sum of two not 
necessarily distinct elements of S. For example, C(3) = 8 with S = {1, 3, 4}. 

(a) Show that, for all e > 0, 1/4 < C(m)/m* < 1/2 + € for all sufficiently large m. 

(b)* Improve the asymptotic bounds in part (a). 


Solution to (a) by the National Security Agency Problems Group, Fort Meade, MD. Let 
[n]/ denote the first n positive multiples of 1. When m is even, with m = 2t, let S = 
[t —1]1 U[t+ 1]t. Since S has size m and represents all positive integers up to (t+ 1)t +1, 
we have C(m) > t? + 2t. Thus C(m)/m? > (t® + 2t)/(2t)? > 1/4. 

When m is odd, with m = 2t + 1, let S = [t]1+ [t+ 1](@+ 1). Since S has size m and 
represents all positive integers up to (t + 1)? ++1+1, we have C (m) > (t+1)(¢+2). Thus 
C(m)/m? > (t + 1)(t + 2)/(2t +1)? > 1/4. 

A set of size m represents at most 2m+ () integers. Hence C(m)/m* < 1/2+3/(2m) < 
1/2+ € form > 3/(2e). 


Solution to (b) by the GCHQ Problems Group, Cheltenham, UK. We show that 9/32 < 
C(m)/m? < 4/9 + « for all sufficiently large m. 

For the lower bound, we construct a set that represents many integers by spreading the 
summands apart more quickly than in (a). Write m as 16i + j, where —7 < j < 8, 
and let A = [1,37], B = [2,71 + j]3i, C = (i+ j)3i + [1, 3i]Gi + 1), and D = 
(7i + j)6i + 6i + [0, 3i], where [x, y] ={n € Z: x <n < y}. LetS=AUBUCUD. 

From A and A+ A we get [1, 67], from A + B we get [61 + 1, (7i + j)3i + 37], and 
from C and A+ C we get [(7i + j)3i + 3i + 1, 3017 + 3i(j + 2)]. 

“Forr € [1,4i + j] ands € [1, 3i], we have (3i + r —s + 1)3i € B and (7i + j)3i + 
s(3i+1) € C, and the sum of these integers is (10i +r + j+1)3i+s. Thus B+C contains 
(110i +2+j)3i+1, (141+2j+1)3i+37], which equals [30i*+3i(j+2)+1, (7i+j)6i+6i]. 
Furthermore, DU (A + D)U(B+ D) = [(7i+ j)6i + 61, (7i + j)9i1 + 91], and C + D= 
(Tit f)9i +91 +1, (i+ f)9i+31(3i+1) +91] = [((7i+ f)9i +97 +1, 7217 +1(124+-9/)]. 

Since |A U BUC U D| = ™, for large enough i we have 


Cm) | T2i7+i012+9j) _ 9 BI +iG 44/3) _ 9 
m: — 256124 32ij + j2 32812 +ij + 72/32 32° 

To prove that C(m)/m? < 4/9 + €, we show that some of the m + (3) pairs must be 
“wasted”. This happens in two ways. First, the sum may be too big, as happens for any 
pair of numbers that both exceed C(m)/2. Second, note that r — s = t — u if and only if 
r+u=t-+s. Thus we obtain a wasted pair for each instance of identical differences. 

Consider a set S that represents everything from 1 to m7, for some > 1/4. We may 
assume that S C [1], um). Let am = |S MN (1, pum? /2]|. All pairs from the (1 — a)m 
numbers above jzm*/2 are wasted. The smaller pairs have differences between 1 and 
jum? /2 — 1, yielding wastage when am + (“3") > um*/2 — 1. 

Let a > b mean that a > b — € for large enough m. Letting wm? be the number of 
wasted pairs, we have w > max(0, (a* — 1) /2a+d—- a)? /2. Letting f(a) denote this 
lower bound, we have f(a) = —(1 — a) for a* < wand f(a) = 2a — 1 fora* > pw. The 
first quantity is negative and the second positive, since 4 > 1/4. Thus w is minimized at 
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* = pw, and hence w > (1 — Jp)? /2. Excluding the wasted sums yields uw < 1/2 — w, 
and so2u <1—(1—./f)? =2./ma — mw. Thus ./p =< 2/3 and p < 4/9. 


Editorial comment. John Lindsey proved that C(m)/m* < .499785077 + €. Kevin Ford 
proved that C(m)/m? < .48832 + € and conjectured that the number on the right can be 
replaced by 1/8 < .393. 


Solved also by R. J. Chapman, K. Ford, J. H. Lindsey IJ, and the proposer, 


An Identity Involving Rooted Trees 


10615 [1997, 767]. Proposed by Joaquin Gomez Rey, Alcorcén, Madrid, Spain. For na 
positive integer, evaluate 


no G—Dki 


So (ki tha te + kn)! I] EDGE 


where the summation runs over all n-tuples (k, k2,..., k,) of nonnegative integers such 
that k; + 2ko +---+nk, =n. 


Solution I by Anchorage Math Solutions Group, University of Alaska, Anchorage, AK. We 
show that the given expression a, equals n”/n!. 

Let f, g, h be exponential generating functions with coefficients f,, 2n, hn, respectively. 
When f(g(x)) = h(x), expanding the composition and collecting terms appropriately 
yields 


8h, ‘ 
h(x) = yn (> 4- LT, Tae one) mm 


where the innermost sum is over partitions A of n that have k; parts of size i for each i and 
I parts all told. When f, =n! and gn =n"7!, this yields h, = n!dp. 

To see that h, = n”, we use g(x) = xe®™), a well-known consequence of the La- 
grange Inversion Formula or of combinatorial manipulation of the exponential generat- 
ing function for rooted trees. Since f(x) = 1/(1 — x) as a formal series, h(x) = 
1/(1 — g(x)). On the other hand, }° n"x"/n! = 14+ xg'(x). Thus it suffices to show that 
(1+ xg’(x))(1 — g(x)) = 1. This follows from computing g’(x), which yields xg’(x) = 


g(x) + xg'(x)g(x). 


Solution IIT by GCHQ Problems Group, Cheltenham, U.K. We show that the sum equals 
n" /n! by counting a set of size n” in another way, obtaining n! times the desired sum. By 
Cayley’s Formula, the set consisting of rooted trees on the vertex set {1,...,”} with one 
vertex marked has size n”. 

Alternatively, we view each such tree as a list of the subtrees that remain after deleting 
the edges on the path from the root to the marked vertex. First we partition the n points into 
subsets, using kj subsets of size i for 1 < i <n. For each n-tuple (k),...,k,), there are 
n\/ (T]; i! k;!) ways to do this is. We multiply this by ()~k;)! and by |], i'~!)“' to order 
the subsets and to place a rooted tree on the elements of each subset. When we assemble the 
trees, their roots in order form the path from the root to the marked vertex in the full tree. 
Since we obtain each rooted tree with marked vertex exactly once, we have proved that 


n! ; ek 
Dy Tyan 2) LE Ds =n", 


Editorial comment. The GCHQ Problems Group noted that omitting the factor ()° kj)! 
yields unordered rooted forests on the n points, which correspond to rooted trees with n + 1 
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labeled points where point n + 1 is constrained to be the root. Thus this approach also yields 


the identity a 
Gays ae ly 
a I] iMiki! ont 
l 


geeey 


Many solvers reduced the evaluation of the coefficient h, in Solution 1 to the convolution 

tao (,)k*(n — ky"-*-! and obtained n” for the sum by manipulating classical identites. 
The same formula is derived in F. Bergeron, G. Labelle, and P. Leroux, Combinatorial 
Species and Tree-like Structures, Cambridge Univ. Press, 1998, as an application of the 
pointing operation on the species of vertebrates, but it may be proved directly using the 
objects in Solution 2: Count the rooted trees with marked vertex having n — k vertices in 
the marked subtree. When k = 0, the root is the marked vertex, and n”~! counts the rooted 
trees. When k > 0, choose a set S of k vertices, a rooted tree with marked vertex on S, and 
a rooted tree on the remaining vertices. Let the marked vertex of the first tree be the parent 
of the root x in the second tree and view x as the marked vertex in the full tree. 


Solved also by R. Bagby, N. Bansal (India), D. Beckwith, D. Callan, R. J. Chapman (U.K.), W. Chu (France), J. H. Lindsey II, 
H.-J. Seiffert (Germany), L. Takacs, D. Zeilberger, NSA Problems Group, and the proposer. 


Divisors of Sums of Divisors 


10617 [1997, 767]. Proposed by James G. Merickel, Philadelphia, PA. For a positive integer 
N, a(N) denotes the sum of the positive divisors of N. Given a positive integer n and a 
prime p, prove that there exist arbitrarily large sets S of multiples of n with the following 
property: For some positive integer m, the fraction o(N)/N reduces to a fraction whose 
denominator is p™ for every N é€ S. 


Solution by John P. Robertson, Berwyn, PA. Factor n as p*v with v relatively prime to p. 
Similarly factor o(v) as p’ w with w relatively prime to p. We first consider the case when 
p 4 2. By Dirichlet’s Theorem, there are arbitrarily large sets of primes not congruent to 
—1 modulo p. Let Q be such a set not containing p or any primes that divide v. Let a be 
the product of the primes in Q. 

Let u = d(av(p — 1)), where ¢ denotes Euler’s ¢-function. Let r = ku — 1 withk 
large enough so that r > max{s, t}, and let m = r —t. For each positive divisor b of a, the 
number N = up’ b is divisible by n because r > s. There are 2!2! such divisors b and hence 
there are 2!2! such N. Thus we need to show only that each fraction 0 (N)/N, reduced, has 
denominator p”™. 

Note that v, p’, and b are pairwise relatively prime. Hence o(N) is o(v)a0(p’)a(b). 
Now o(p’) is (p’t! — 1)/(p—1). Because rr + 1 is a multiple of u and p is relatively prime 
to av(p — 1), we find that (pt! — 1)/(bv(p — 1)) is an integer that is not divisible by p. 
- Also p does not divide the integer w = a (v)/p’, and p does not divide o(b), because no 
prime in Q is congruent to —1 modulo p. Thus 


s(n) |e@)/P'|| et! — D/Ge(p - ))][o®] 
ee a 
where each item in brackets is an integer that is not divisible by p. 
For the case p = 2, we repeat the argument with the following changes: Let Q be any 
set of odd primes that do not divide v. Use a? in place of a and b? in place of b, set 


u = d(a?v(p — 1)), and set N = up"b?. Since a (b”) is odd, the argument proceeds in the 
same way. 


Solved also by GCHQ Problems Group and the proposer. 
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Edited by Harold P. Boas 
Mathematics Department, Texas A & M University, College Station, TX 77843-3368 


My Brain Is Open: The Mathematical Journeys of Paul Erdos. By Bruce Schechter. 


Simon& Schuster, 1998, 224 pp., $25. 
The Man Who Loved Only Numbers: The Story of Paul Erdés and the Search for 
Mathematical Truth. By Paul Hoffman. Hyperion, 1998, 289 pp., $22.95. 


Reviewed by Albert A. Mullin 


Trying to understand the late, great Paul Erdds (1913-1996) is like trying to write 
a beautiful palindromic sonnet! Erdés was a genius wrapped in a mathematician 
inside an eccentric. A great problem poser and problem solver, he could see 
analogies between analogies. Personally independent in spite of his gregarious 
nature, he had an almost fanatical love of justice, partly due to some bad 
experiences in his own life. He always had a high regard for every form of 
intellectual aspiration and spiritual effort. 

Like Mersenne, Erdos greatly advanced mathematics by extensively publicizing 
and propagating results and conjectures. Like Euler, Erdés published prodigiously 
and maintained his full intellectual faculties until the very day of his death. Since 
posthumous articles are still appearing, his complete works may reach a total of 
1500 items. Like his fellow countryman John von Neumann, Erdos had a vivid 
interest in people. On the other hand, few mathematicians delighted in gossip like 
Johnny—certainly not Erdés. Like Godel, Erdés had no formal students, although 
he wrote joint papers with nearly 500 different collaborators. Erdos had neither 
the “genius’ ego” nor the “actor’s ego,” either of which can be disastrous for 
cooperation. Both Godel and Erdos will likely have an endless number of students 
of their works. 

Much has been made of Erdos’ eccentricity; his peripatetic lifestyle; and his 
bantering use of terms such as “epsilon,” “boss,” and “slave” (“child,” “wife,” and 
“husband’’), abbreviations such as P.G.O.M. (Poor Great Old Man) and S.F. (God, 
the Supreme Fascist), and metaphors such as THE BOOK (the S.F.’s transfinite 
volume containing the most elegant proofs of all theorems). Earlier this century, 
James Joyce made this whimsical, elliptical way of communicating intellectually 
respectable in his last two novels, Ulysses and Finnegans Wake. Erdos’ eccentricity 
was often amusing and always harmless: sufficiently spectacular to provoke com- 
ment, but never serious enough to constitute disintegration. Edmund Landau 
summarized the situation well when he said, ‘““Wir Mathematiker sind alle ein 
biBchen meschugge” (““We mathematicians are all a bit wacky’). 

There are numerous tales about Erd6s; remarkably, many are true. Erdés 
himself would tell jokes in which he figured, but on close inspection, the jokes tell 
as much about others as they do about Erdos. For example, there is the neat story 
about Erdos traveling around the world by airplane, as he did so often. An epsilon 
is sitting between Erdos and the boy’s mother. The boy is studying plane geometry, 
but he is puzzled by a certain result. Always looking for new talent, Erdds politely 
offers to help, but the boy refuses. The boy’s mother scolds, “Listen to the man!” 
But the boy cries out to his mother, “What could he know about geometry?” Then 
there is the story about an old mathematician who declared that he had been 
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fortunate to have a wonderful life: he had proved elegant theorems and he had 
met many famous people. But he regretted never meeting Einstein. Erdés mut- 
tered something like, “Fear not, soon you will!” Erd6és could joke positively about 
many situations that really were quite negative when viewed deeply enough. He 
enjoyed his job at Notre Dame: “Who could dislike a place with plus signs 
everywhere?” 

In his later years, Erdés was constantly on the move. In Erdosian verse: 
“Another roof, another proof.” His lifestyle after 1971 came straight from Keats: 
“What mad pursuit? What struggle to escape?” Erddés never married, perhaps 
because he preferred truth to beauty. He moved about freely and often, spreading 
the mathematical Word everywhere. He never sought “glory” because he realized 
it was a little like the elegant bed of Louis XIV: namely, magnificent, but full of 
bugs! Erdos lived slightly longer than Hilbert, but slightly shorter than Dedekind. 
His own wry comment on getting old was: “I am becoming stupider no more.” 

One must ask several questions about Erdos’ super-productive life. How could 
he settle the Prime Number Theorem so easily without recourse to complex 
function theory? What prepared him for the necessary insights? Why didn’t he 
settle the Riemann Hypothesis? Why didn’t he settle Fermat’s Last Theorem in an 
elementary way? I doubt that THE BOOK contains any 200-plus page proof of 
Fermat’s Last Theorem, nor do I expect a short proof to be in the margin. Surely 
his deep conviction of the important role of discreteness guided Erdos away from 
various problems. His use of probabilistic methods in graph theory and Ramsey 
theory will long be remembered, but why didn’t he find a constructive probabilistic 
method? Finally, one must be very puzzled that Erd6s was rejected for work on the 
Manhattan Project at Los Alamos, while Klaus Fuchs was accepted. However, 
sometimes rejection is a great blessing in disguise. 

Erd6s’ motto was: conjectures and proofs! His main interest was with the most 
primitive and basic objects of mathematics: the positive integers. Of course, he was 
interested in other areas of mathematics too: number theory; combinatorics (finite 
and transfinite); combinatorial geometry; set theory, but not logic; analysis; and 
probability theory, with special emphasis on probabilistic methods for the existence 
of formal objects. His approach to the investigation of this broad range of topics 
was uniform: unleash a barrage of cogent challenges. Then he would politely cajole 
people into working on them, almost always successfully. Most mathematicians are 
good at either conjecture or proof, but Erdés was great at both! 

Here we have a pair of well written and sympathetic books (each authored by a 
non-mathematician) on Erddés’ life and works. There is considerable overlap in 
topics between the books, and I could find no meaningful contradiction of views 
between them. Nevertheless, the two books do have different flavors and ap- 
proaches. I found My Brain Is Open slightly more mathematical and more willing 
to go into details of technical results. On the other hand, I found The Man Who 
Loved Only Numbers more socially oriented; it even contains a detailed story about 
Erdos’ intuition and Marilyn vos Savant’s column on the Monty hall dilemma. It 
also has sub-biographies of several mathematicians who make more than cameo 
appearances. The different flavors of the two books are illustrated by the types of 
errors they contain. For instance, The Man Who Loved Only Numbers cites an 
erroneous value of a (pages 92 and 209), while My Brain Is Open confuses Ultra 
(a complex Allied operation involving intercepting and rendering intelligible enci- 
phered enemy signals) with the codes used by Nazi Enigma machines. I suspect 
that life in Hungary between the World Wars was far worse than these books 
portray. 

My Brain Is Open contains a broad spectrum of interesting topics: the many 
travels of the wandering mathematician; his early life and friendships in Hungary; 
mathematical proof (including THE BOOK where elegant proofs are found); 
Squaring squares and Nazi Enigma machines; Ramsey theory; life at the Institute 
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for Advanced Study (including discussions with Einstein, G6del, and von 
Neumann); number sieves; the great Prime Number Theorem controversy between 
Erd6és and Selberg; and his many collaborations. The Man Who Loved Only 
Numbers also contains a rich vein of interesting subjects: straight from THE 
BOOK; problems with Sam and Joe; Einstein vs. Dostoevsky; marginal revenge; 
S.F. made the integers; and survivor’s party. Each of the books has a splendid set 
of photographs of Erdés at various points in Jhis life (only one photograph is 
common to the two books). Further, The Man Who Loved Only Numbers contains 
sample letters showing Erdés’ noteworthy handwriting and terse style. Each of the 
bibliographies documents numerous technical results very well and points to useful 
sources for much additional information about Erdés. 

In a world where the only modern mathematician that an average person can 
name is Theodore Kaczynski, it is wonderful to have these two accounts of the 
achievements and the humanity of the beloved Paul Erdos. I highly recommend 
The Man Who Loved Only Numbers to the intelligent general reader interested in 
the life and results of a mathematical genius, and I highly recommend My Brain Is 
Open to mathematicians, physicists, and computer scientists. For those saddened 
now that Erdos has, as he would have put it, “left,” have heart, for now he knows 
whether the Riemann Hypothesis is true or not! And maybe the proof doesn’t 
require any complex function theory. 


506 Seaborn Drive NW, Huntsville, AL 35806 — 1831 
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TELEGRAPHIC REVIEWS 


Edited by Arnold Ostebee 


with the assistance of the Mathematics Departments of 
Carleton, Macalester, and St. Olaf Colleges 


Telegraphic Reviews are designed to alert readers in a timely manner to new books 
appropriate to mathematics teaching and research. Special codes classify reviews by 
subject area and appropriate use: 

T : Textbook P: Professional Reading 1~4: Semester 

C : Computer Software L: Undergraduate Library = ** : Special Emphasis 

S : Supplementary Reading 13: Grade Level ?? : Questionable 
Readers are advised that price information is subject to change. Selected books 


receive a second, more extensive review in the Monthly. 


Books submitted for review should be sent to Book Reviews Editor, American Mathe- 
matical Monthly, St. Olaf College, 1520 St. Olaf Avenue, Northfield, MN 55057-1098. 


Reference, P. Handbook of Numerical Analy- 
sis, Volume VI. Eds: P.G. Ciarlet, J.L. Lions. El- 
sevier Science, 1998, x + 689 pp, $164. [ISBN 
0-444-82569-X] Three articles in two sec- 
tions: “Numerical Methods for Solids (Part 3)” 
and “‘Numerical Methods for Fluids (Part 1).” 


Mathematics Appreciation, S, L. Strength in 
Numbers. Sherman K. Stein. Wiley, 1996, xiii 
+ 272 pp, $16.95 (P). [ISBN 0-471-32974- 
6] Subtitle (Discovering the Joy and Power of 
Mathematics in Everyday Life) is apt. Bits of 
real mathematics are described, explained, put 
in context, and admired. Requires only high 
school background; less in most parts. Clear, 
friendly, sometimes elegant; even the polemics 
arefun. PZ 


Mathematics Appreciation, P, L. Drawbridge 
Up: Mathematics—A Cultural Anathema. Hans 
Magnus Enzensberger. Transl: Tom Artin. AK 
Peters, 1999, 48 pp, $5 (P). [ISBN 1-56881- 
099-7] A bilingual pamphlet (German origi- 
nal and English translation) containing the text 
of a talk given by poet and writer Enzensberger 
at the 1998 ICM meeting in Berlin. A lamen- 
tation over the “increasingly critical” paradox 
of the cultural isolation of mathematics at the 
peak of its “golden age” marked by “spectacu- 
lar” achievements and applications. LAS 


Recreational Mathematics, S, L*. The Math- 
emagician and Pied Puzzler: A Collection 
in Tribute to Martin Gardner. Eds: Elwyn 
Berlekamp, Tom Rodgers. AK Peters, 1999, 
x + 266 pp, $34 (P). [ISBN 1-56881-075-X] 
Proceedings of the first “gathering for Gard- 
ner” held in 1993. Games, puzzles, and recre- 
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ational mathematics—all accessible to general 
readers—contributed by the world’s foremost 
magicians, puzzlists, and mathematicians. LCL 


History, S(14-17), P, L**. Euler: The Master 
of Us All. William Dunham. Dolciana Math. 
Expos., No. 22. MAA, 1999, xxviii + 185 pp, 
$29.95 (P). [ISBN 0-88385-328-0] A neces- 
sarily sparse sample of Euler’s major contribu- 
tions to diverse areas of mathematics (number 
theory, infinite series, complex variables, alge- 
bra, geometry, and combinatorics) conveyed in 
contemporary notation yet faithful to Euler’s 
approach—even his occasional “mathematical 
madness.” Each chapter sets the stage by de- 
scribing clearly the efforts that preceded Euler, 
the magnitude of the challenge he faced, and 
the impact of the contribution he made. LAS 


History, P. Selected Publications of Eugene 
L. Lawler. Eds: K. Aardal, et al. CWI Tract, 
V. 126. Centrum voor Wiskunde en Informat- 
ica, 1999, x + 318 pp, Dfl. 60 (P). [ISBN 90- 
6196-484-9] 26 of Lawler’s technical and ex- 
pository papers as well as a complete list of his 
publications. 


Foundations, T(14), L. An Introduction to Ab- 
stract Mathematics. Robert J. Bond, William 
J. Keane. Brooks/Cole, 1999, xix + 323 pp. 
[ISBN 0-534-95950-7] Based on a “transi- 
tions” course at Boston College. Chapters 1-5 
introduce logic, sets, functions, relations, and 
the integers. Chapters 6-8 give applications to 
infinite sets, real and complex numbers, poly- 
nomials. Year of calculus assumed. Many ex- 
amples and exercises. JD 


Foundations, S(13-18), P, L*. A Mathemat- 
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ical Mystery Tour: Discovering the Truth and 
Beauty of the Cosmos. A.K. Dewdney. Wiley, 
1999, vi+ 218 pp, $22.95. [ISBN 0-471-23847- 
3]. An imaginative, meandering tour of math- 
ematics in the service of resolving two abiding 
mysteries: mathematics’ “unreasonable’’ util- 
ity, and whether it is discovered or created. 
Dewdney suggests that perhaps the Pythagore- 
ans were right after all—that mathematics ex- 
ists, awaiting discovery, in an other-worldly 
‘holos” out of which the cosmos comes into 
being. LAS 


Discrete Mathematics, T(13-15: 1, 2). Dis- 
crete Mathematics and Its Applications, Fourth 
Edition. Kenneth H. Rosen. McGraw-Hill, 
1999, xxii + 804 pp. [ISBN 0-07-289905-0] 
New section on generating functions; more on 
rules of inference; ties to new web site. (Second 
Edition, TR, November 1991.) DB 


Number Theory, T(14-16: 1, 2). Elements 
of the Theory of Numbers. Joseph B. Dence, 
Thomas P. Dence. Academic Pr, 1999, xvii 
+ 517 pp. [ISBN 0-12-209130-2] Well-done 
but fairly standard introduction to number the- 
ory. Includes introduction to number fields and 
partition theory. DB 


Number Theory, T(13-15: 1), L. The Math- 
ematics of Ciphers: Number Theory and RSA 
Cryptography. S.C. Coutinho. AK Peters, 
1999, xv + 196 pp, $30. [ISBN 1-56881-082- 
2] A gentle amd very readable introduction 
to number theory that culminates in the RSA 
public-key cryptosystem. DB 


Linear Algebra, T(14). Introduction to Lin- 
ear Algebra. Donald J. Wright. McGraw- 
Hill, 1999, ix + 392 pp. [ISBN 0-07-072098-3] 
From the Preface: “The goal is to learn to think 
in terms of linear algebra notions ... and that 
sort of familiarity comes from using the ideas in 
a substantive way.” Contains a variety of useful 
examples. PF 


Ring Theory, P, L. Rings and Things and a 
Fine Array of Twentieth Century Associative Al- 
gebra. Carl Faith. Math. Surv. & Mono., V. 65. 
AMS, 1999, xxxii + 422 pp, $99. [ISBN 0- 
8218-0993-8] Exhaustive survey of 125 years 
of associative algebras, ring and module the- 
ory. The author’s Algebra I and II serve as the 
foundation for this survey. Also includes the 
author’s thoughts on mathematics and mathe- 
maticians of the last 50 years. Bibliography has 
over 1600 references. JD 


Algebra, T(17: 2), P, L. Post-Modern Algebra. 
Jonathan D.H. Smith, Anna B. Romanowska. 
Pure & Appl. Math. Wiley, 1999, xi + 370 pp, 
$69.95. [ISBN 0-471-12738-8] Introduction 
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to algebra from an applications-based perspec- 
tive. Traditional topics of groups, rings, fields, 
modules are accompanied by monoids, quasi- 
groups, lattices, Boolean algebras, and more. 
Structures unified by techniques of universal 
algebra and category theory. JD 


Algebra, P. Differential and Difference Di- 
mension Polynomials. M.V. Kondratieva, et al. 
Math. & Its Applic., V. 461. Kluwer Academic, 
1999, xiii + 422 pp. [ISBN 0-7923-5484-2] 


Real Analysis, T*(16-17: 3), L. A Course in 
Real Analysis. John N. McDonald, Neil A. 
Weiss. Academic Pr, 1999, xvii + 745 pp. 
[ISBN 0-12-742830-5] Each chapter begins 
with a biography of a key contributor (from 
Cantor to Daubechies). Discusses the usual top- 
ics: set theory, real number system, Lebesgue 
theory, metric spaces. Further interest is piqued 
by chapters on probability, harmonic analysis, 
dynamical systems. Lots of exercises. Teachers 
of analysis—have a look! KS 


Real Analysis, T(17), P. Real Analysis—With 
an Introduction to Wavelet Theory. Satoru 
Igari. Transl: Satoru Igari. Transl. of 
Math. Mono., V. 177. AMS, 1998, xiii + 
256 pp, $89. [ISBN 0-8218-0864-8] Cov- 
ers Lebesgue measure and integration, differ- 
entiation, abstract measures, L’-spaces, dis- 
tribution theory, Fourier analysis. Rather dry 
“definition-theorem-proof” exposition. Intro- 
duction to wavelets is very brief. BH 


Partial Differential Equations, T(16: 1), C, 
L. Partial Differential Equations and Bounda- 
ary Value Problems with Maple V. George A. 
Articolo. Academic Pr, 1998, xi1 + 628 pp, 
(P), with CD-ROM. [ISBN 0-12-064475-4] 
Traditional approach to PDEs incorporating a 
brief discussion of useful Maple commands 
and many problems solved using Maple. Be- 
gins with a review of ODEs and a chapter 
on Sturm—Liouville eigenvalue problems. In- 
cludes discussion of heat, wave, and Laplace 
equations. PG 


Partial Differential Equations, P. Differen- 
tial Operators and Spectral Theory: M. Sh. 
Birman’s 70th Anniversary Collection. Eds: 
V. Buslaev, M. Solomyak, D. Yafaev. AMS 
Transl., Ser. 2, V. 189. AMS, 1999, viii + 
285 pp, $99. [ISBN 0-8218-1387-0] 


Functional Analysis, P. /ntroduction to the 
Theory and Applications of Functional Dif- 
ferential Equations. V. Kolmanovsku, A. 
Myshkis. Math. & Its Applic., V. 463. Kluwer 
Academic, 1999, xvi + 648 pp, $295. [ISBN 
0-7923-5504-0] 


Analysis, P. Partial Differential and Inte- 
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gral Equations. Eds: Heinrich G.W. Begehr, 
Robert P. Gilbert, Guo-Chun Wen. _ Intern. 
Soc. for Analy., Applic. & Computat., V. 2. 
Kluwer Academic, 1999, x + 369 pp, $168. 
[ISBN 0-7923-5482-6] Proceedings of a 1997 
congress. Topics: function theoretic and func- 
tional analytic methods for PDEs; applications 
of function theory of several complex variables 
to PDEs; integral equations and boundary value 
problems; PDEs. 


Algebraic Geometry, P. Mirror Symmetry and 
Algebraic Geometry. David A. Cox, Sheldon 
Katz. Math. Surveys & Mono., V. 68. AMS, 
1999, xxi+469 pp, $69. [ISBN 0-8218-1059-6] 
Supersymmetric string theories exhibit interest- 
ing properties yet have shaky foundations, un- 
verified by experiment. Exposits the algebraic 
geometry likely needed for a firm mathemati- 
cal foundation. “Mirror pairs” of Calabi—-Yau 
manifolds play a central role. RM 


Algebraic Geometry, P. The Curves Semi- 
nar at Queen’s, Volume XII. Ed: Anthony V. 
Geramita. Pure & Appl. Math., V. 114. Queen’s 
Univ, 1998, 162 pp, (P). [ISBN 0-8891 1-832- 
9] Includes an expository paper on “Compu- 
tational Invariant Theory” by Gregor Kemper 
as well as five contributed notes. 


Differential Geometry, T?(17-18: 1), P. 
Meromorphic Functions and Projective Curves. 
Kichoon Yang. Math. & Its Applic., V. 464. 
Kluwer Academic, 1999, viii + 201 pp, 
$105. [ISBN 0-7923-5505-9] Concise, self- 
contained introduction and exposition of mero- 
morphic functions of algebraic curves from a 
geometric viewpoint. Topics include Brill- 
Noether theory, projective differential geome- 
try, minimal surfaces in Kahler manifolds. RM 
Topology, T(17: 2), P, L. Aspects of Topol- 
ogy, Second Edition. Charles O. Christen- 
son, William L. Voxman. BCS Associates, 
1998, x + 493 pp, $48 (P); $75. [ISBN 0- 
914351-08-7; 0-914351-07-9] Topics include 
topolozical spaces, continua, homotopy theory, 
n-manifolds, and dimension theory. Changes 
from First Edition (TR, April 1978): rewrites 
on CW-complexes and covering spaces, vari- 
ous clarifications. JD 


Operations Research, T(17-18: 1), P. Ge- 
ometric Methods and Optimization Problems. 
V. Boltyanski, H. Martini, V. Soltan. Comb. 
Optim., V. 4. Kluwer Academic, 1999, viii 
+ 429 pp, $204. [ISBN 0-7923-5454-0] Ap- 
plies methods from convex geometry to solve 
optimization problems in control theory (mod- 
ern variational geometry), location science, and 
computational geometry. Many intuitive dia- 
grams motivate the ideas. RM 
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Optimization, T*(15-16: 1). Practical Ge- 
netic Algorithms. Randy L. Haupt, Sue 
Ellen Haupt. Wiley, 1998, xiv + 177 pp, 
$44.95. [ISBN 0-471-18873-5] Genetic algo- 
rithms use the model of evolutionary selection 
to solve a wide variety of optimization prob- 
lems. Text starts with the basics of optimization 


‘and then shows how both discrete and contin- 


uous problems can be attacked using genetic 
algorithms. Many applications, some pseu- 
docode, good references. No exercises. All 
in all, an interesting book. MPR 


Optimization, P. Minimax Theorems and 
Qualitative Properties of the Solutions of Hemi- 
variational Inequalities. D. Motreanu, P.D. 
Panagiotopoulos. Nonconvex Optim. & Its Ap- 
plic., V. 29. Kluwer Academic, 1999, xviii + 
309 pp, $156. [ISBN 0-7923-5456-7] 


Optimization, P. Reformulation: Nonsmooth, 
Piecewise Smooth, Semismooth and Smoothing 
Methods. Eds: Masao Fukushima, Liqun Qi. 
Appl. Optim., V. 22. Kluwer Academic, 1999, 
viii + 441 pp, $194. [ISBN 0-7923-5320-X] 
22 refereed papers; most based on talks given 
at the 16th International Symposium on Mathe- 
matical Programming held at Lausanne, EPFL, 
Switzerland, in 1997. 


Elementary Statistics, T(17: 1), C. Data, 
Statistics, and Decision Models with Excel. 
Donald L. Harnett, James F. Horrell. Wi- 
ley, 1998, xviii + 605 pp, $93.95, with disks. 
[ISBN 0-471-13398-1] Conceptual modeling 
and statistics text for MBA students. Empha- 
sizes problem solving and decision making; cal- 
culus not required. Integrates use of Excel. 
Disks contain data and an Excel add-in. HS 


Elementary Statistics, T(14: 2). Introduction 
to Probability and Statistics for Scientists and 
Engineers. Walter A. Rosenkrantz. Ser. in 
Prob. & Stat. McGraw-Hill, 1997, xiv + 
592 pp. [ISBN 0-07-053988-X] Covers ba- 
sic probability theory, sampling distributions, 
estimation, inference, linear and multiple re- 
gression, ANOVA, block designs, and control 
charts. MPR : 


Mathematical Statistics, T(15: 1). Mathemat- 
ical Statistics: An Introduction. Wiebe R. Pest- 
man. Walter de Gruyter, 1998, ix + 545 pp, 
$79 (P). [ISBN 3-11-015356-4] Theoreti- 
cally rigorous introduction; includes a chapter 
on probability theory. Assumes knowledge of 
calculus and linear algebra. Companion vol- 
ume (see following review) contains solutions 
to all 260 exercises. HS 


Mathematical Statistics, S. Mathematical 
Statistics: Problems and Detailed Solutions. 
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Wiebe R. Pestman, Ivo B. Alberink. Walter 
de Gruyter, 1998, ix + 325 pp, $79 (P). [ISBN 
3-11-015358-0] Detailed solutions to all 260 
exercises in Pestman’s Mathematical Statistics: 
An Introduction (see previous review). HS 


Statistical Methods, T(18:2), P. Statistical 
Learning Theory. Wladimir N. Vapnik. Wi- 
ley, 1998, xxiv + 736 pp, $105. [ISBN 0-471- 
03003-1] Statistical learning theory explores 
ways of estimating functional dependency from 
a given collection of data for small samples 
without a priori knowledge about the prob- 
lem to be solved. This comprehensive study 
of learning processes contains: general quali- 
tative theory including necessary and sufficient 
conditions for consistency; general quantitative 
theory including bounds on the rate of conver- 
gence; principles for estimating functions from 
small data sets; methods of function estimation; 
applications to real-life problems. KB 


Statistical Methods, T(16—-17: 1), P. Sam- 
pling of Populations: Methods and Applica- 
tions, Third Edition. Paul S. Levy, Stanley 
Lemeshow. Ser. in Prob. & Stat. Wiley, 1999, 
XXx1 + 525 pp, $89.95. [ISBN 0-471-15575-6] 
Refines Second Edition’s treatment of telephone 
sampling and interviewing methodology. Ma- 
terial on survey data analysis now discusses use 
of appropriate software. HS 


Statistical Methods, P. The Analysis of Vari- 
ance. Henry Scheffé. Wiley Classics Library. 
Wiley, 1999, xvi + 477 pp, $49.95 (P). [ISBN 0- 
471-34505-9] Paperback printing of text orig- 
inally published in 1959. 


Statistics, C. JMP Start Statistics: A Guide 
to Statistics and Data Analysis Using JMP and 
JMP IN Software. John Sall, Ann Lehman. 
Duxbury Pr (Wadsworth), 1996, xxii + 521 pp, 
$61.95 (P), with disks. [ISBN 0-534-26565-0] 
Detailed introduction to JMP and JMP IN. Step- 
by-step instructions on how to perform many 
common statistical tests and procedures. Very 
thorough; many screen shots. MPR 


Statistics, P. Entropy Methods in Statistical Es- 
timation. M.H. Wegkamp. CWI Tract, V. 125. 
Centrum voor Wiskunde en Informatica, 1998, 
120 pp, Dfl. 35 (P). [ISBN 90-6196-483-0] 


Mathematical Computing, T(15-16), S. Ad- 
vanced Engineering Mathematics with Mathe- 
matica and MATLAB, Volume I. Reza Malek- 
Madani. Addison-Wesley, 1998, xiv + 558 pp, 
(P). [ISBN 0-201-59881-7] After brief intro- 
ductions to Mathematica and MATLAB, covers 
ODEs, transform methods, linear algebra, sys- 
tems of DEs, and numerical methods. Nice in- 
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troduction to the use of these packages. Plenty 
of exercises. MPR 


Computer Science, T(18), P. Queueing Net- 
works and Markov Chains: Modeling and Per- 
formance Evaluation with Computer Science 
Applications. Gunter Bolch, et al. Wiley, 1998, 
xvi + 726 pp, $89.95. [ISBN 0-471-19366-6]. 
Presents the theory of computer performance 
analysis from the perspective of queueing the- 
ory and Markov chains. Applications include 
client-server systems, polling systems, operat- 
ing systems, ATM networks. MPR 


Applications (Biological Science), P. Mathe- 
matical and Computational Biology: Computa- 
tional Morphogenesis, Hierarchical Complex- 
ity, and Digital Evolution. Ed: Chrystopher 
L. Nehaniv. Lect. on Math. in the Life Sci., 
V. 26. AMS, 1999, xi +201 pp, $59 (P). [ISBN 
0-8218-0941-5] Proceedings of a 1997 work- 
shop at the University of Aizu, Japan. 
Applications (Physics), T(17: 2), P. Geom- 
etry, Particles, and Fields. Bjg@rn Felsager. 
Grad. Texts in Contemp. Physics. Springer- 
Verlag, 1998, x + 672 pp, $69.95. [ISBN 0- 
387-98267-1] Geometric approach to modern 
particle field theory. First half is a relatively 
self-contained introduction to field theory, with 
an emphasis on gauge theory and nonlinear field 
theory. Second half is an introduction to dif- 
ferential geometry and its application to field 
theory. MPR 


Applications (Statistical Mechanics), T*(16— 
17: 2), P*. A Modern Course in Statistical 
Physics, Second Edition. L.E. Reichl. Wiley, 
1998, xix + 822 pp, $84.95. [ISBN 0-471- 
59520-9] Thorough, self-contained introduc- 
tion to statistical mechanics using thermody- 
namics and probability theory as a foundation. 
Especially interesting for its extended coverage 
of non-equilibrium processes. Excellent exam- 
ples and exercises. An outstanding choice for 
either a text or a reference. MPR 


Applications, P. Traffic Control and Trans- 
port Planning: A Fuzzy Sets and Neural Net- 
works Approach. DuSan Teodorovié¢, Katarina 
Vukadinovié. Intern. Ser. in Intelligent Tech. 
Kluwer Academic, 1998, xviii + 387 pp, $150. 
[ISBN 0-7923-8380-X] 


Reviewers 


KB: Karla Ballman, Macalester; DB: David Bressoud, 
Macalester; JD: Jill Dietz, St. Olaf; PF: Paul Froeschl, 
Macalester; PG: Philip Gloor, St. Olaf; BH: Bruce Han- 
son, St. Olaf; LCL: Loren C. Larson, St. Olaf; RM: Richard 
Molnar, Macalester; AO: Amold Ostebee, St. Olaf; MPR: 
Matthew P. Richey, St. Olaf; KS: Karen Saxe, Macalester; 
HS: Heidi Shierholz, St. Olaf; LAS: Lynn Arthur Steen, 
St. Olaf; PZ: Paul Zorn, St. Olaf. 
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LESTER R. FORD AWARDS FOR 1998 


The Lester R. Ford Awards, established in 1964, are made annually to authors of 
outstanding expository papers in the MONTHLY. The awards are named for Lester 
R. Ford, Sr., a distinguished mathematician, editor of the MONTHLY (1942-46), 
and President of the Mathematical Association of America (1947-1948). 

Winners of the Lester R. Ford Awards for expository papers appearing in 
Volume 105 (1998) of the MONTHLY are: 


Yoav Benyamini, Technion—Israel Institute of Technology, Applications of the 
Universal Surjectivity of the Cantor Set, pp. 832-839 


A classical theorem due to Alexandroff and Hausdorff states that every compact 
metric space is the continuous image of the Cantor set. In this paper Yoav 
Benyamini presents striking applications of this result to diverse areas of mathe- 
matics. Each of these applications involves an existence theorem that Benyamini 
shows us how to prove using the universal surjectivity of the Cantor set. Some of 
these results are well known, such as the existence of space-filling curves and the 
isometric identification of every separable Banach space with a subspace of 
C([0,1]). Other results are more unusual, such as the existence of a compact convex 
subset of R”*? whose faces include congruent copies of all compact convex subsets 
of the unit cube in R”. Other results are even more counter-intuitive, such as the 
existence of a continuous real-valued function f on R with the property that for 
every bounded sequence (a,,) of real numbers, there exists f € Rwith f(t + n) =a, 
for all m. Benyamini ties all these results together in a pretty package with the 
common theme that the Cantor set and its universal surjectivity lurk behind many 
strange phenomena. 


Jerry L. Kazdan, University of Pennsylvania, Solving Equations, an Elegant Legacy, 
pp. 1-21 


The paper discusses various types of equations: polynomial equations in one 
and several variables, linear and nonlinear differential equations, diophantine 
equations, and congruences. The overriding idea is that familiar procedures for 
solving equations, often viewed as “tricks”, can be seen as belonging to broad 
themes that, in turn, yield new insights on equations. Among the themes are: 
exploiting symmetry, finding a related problem, understanding the family of all 
solutions, finding obstructions when an equation has no solution, using variational 
methods, and reformulating a problem. An extensive discussion of symmetry is an 
important unifying thread. It bears on complex conjugation, linear differential 
equations, Markov chains, Lie’s “Galois” theory of differential equations, and 
Pell’s equation. Kazdan’s article is an instructive and wide-ranging tour of the © 
mathematician’s workshop in important classes of equations. 
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Bernd Sturmfels, University of California, Berkeley, Polynomial Equations and 
Convex Polytopes, pp. 907-922 


How many complex zeros can d polynomials in d variables have? In the case of 
the bivariate system 


a, +a,x + a,xy + ayy =b, + db, x*y + bjxy* = 0 


with nonzero real coefficients, Bezout’s theorem gives an upper bound of six 
solutions. But it has exactly four. To achieve this better estimate, we use an idea of 
Newton. To a bivariate polynomial )ix“y’, associate its Newton polygon, the convex 
hull of the vertices (u, v). The mixed area (P,Q) of two planar polygons P, Q is 
defined by 


M( P,Q) = area(P + Q) — area( P) — area(Q). 


In 1975, David Bernstein proved a general theorem that for two equations in two 
unknowns shows that the number of solutions of a system of two bivariate 
polynomial equations is equal to the mixed area of the two corresponding Newton 
polygons. Sturmfels outlines an algorithmic proof devised by B. Huber and himself 
in 1995 that leads to a numerical approximation for the solution. The author deftly 
avoids getting overwhelmed by algebraic and geometric detail by using examples 
and organizing his account of the proof of Bernstein’s theorem around three key 
steps. The case of real zeros has been seriously investigated for only the past 
twenty years and little is known. Sturmfels brings us into the cut and thrust of 
current research with its distance between conjecture and reality and its open 
questions, leaving much to do that is of interest to combinatorialists, algebraic 
geometers, and applied mathematicians. 
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Introductory Lectures on Rings 
and Modules 


J. Beachy 


A highly accessible introduction to the study 
of the noncommutative aspects of rings and 
modules. Ideal as a first-year graduate text or 
| as a reference for advanced undergraduates. 


London Mathematical Society Student Texts 47 
1999 246 pp. 
0-521-64340-6 
0-521-64407-0 


Hardback 
Paperback 


$64.95 
$24.95 


Geometry 


David A. Brannan, Matthew F. Esplen, 
and Jeremy J. Gray 


A richly illustrated exploration of various 
geometries: affine, projective, inversive, 
non-Euclidean and spherical. Includes full 
solutions to over 200 problems. 


1999 510 pp. 
0-521-59193-7 
0-521-59787-0 


Hardback 
Paperback 


$74.95 
$29.95 


Mathematical Explorations 
with MATLAB 


Ke Chen, Peter J. Giblin, and 
Alan Irving 


Surveys the mathematics most frequently encoun- 
i tered in first-year university courses through the 
use of MATLAB, a powerful software package. 
All extras to the standard MATLAB package are 
supplied on the World Wide Web. 


1999 320 pp. 
0-521-63078-9 
0-521-63920-4 


Hardback 
Paperback 


$64.95 
$24.95 


The Mathematica® Book, 
Version 4 

Fourth Edition 

Stephen Wolfram 


From the reviews of the previous edition: 


“Perhaps the finest book ever assembled on a 
software package.” Amazon.com 


Copublished with Wolfram Media 


1999 1496 pp. 


0-521-64314-7 Hardback $49.95 


An Introduction to 


Mathematical Finance 
Options and Other Topics 


Sheldon M. Ross 


A mathematically elementary entrée to options 
pricing theory. The text requires no prior 
knowledge of probability and covers all the 
necessary preliminaries simply and clearly. 
1999 224 pp. 
0-521-77043-2 


Hardback $34.95 


Elementary Number Theory 
in Nine Chapters 
James J. Tattersall 


A superb text for a one-semester introductory course 
in number theory. Tattersall adopts a historical per- 
spective and emphasizes the subject’s applied aspects, 


especially cryptography. 


1999 c.413 pp. 
0-521-58503-1 Hardback $74.95 
0-521-58531-7 Paperback $34.95 


Fourier Analysis on Finite 
Groups and Applications 
Audrey Terras 


A gentle introduction to Fourier analysis on 

finite groups. Terras presents a concrete approach 
to abstract group theory through applied examples, 
pictures and computer experiments. 


London Mathematical Society Student Texts 43 


1999 452 pp. 
0-521-45108-6 
0-521-45718-1 


Hardback 
Paperback 


$74.95 
$29.95 


Contemporary Issues in 
Mathematics Education 

Estela A. Gavosto, Steven G. Krantz, 
and William McCallum, Editors 


Presents a serious discussion of current educational 
issues, with a balanced representation of opposing 
ideas. 


Mathematical Sciences Research Institute 
Publications 36 


1999 184 pp. 
0-521-65255-3 Hardback $54.95 
0-521-65471-8 Paperback $19.95 


Available in bookstores or from 


CAMBRIDGE 


UNIVERSITY PRESS 


40 West 20th Street, New York, NY 10011-4211 
Call toll-free 800-872-7423 
Web site: http://www.cup.org 


MasterCard/VISA accepted. Prices subject to change. 


AMERICAN MATHEMATICAL SOCIETY 


The AMS invites authors to submit manuscripts to be considered for publication in the Student 


Mathematical Library, a new series of undergraduate studies in mathematics. Books to be 
published in the series should be suitable for honors courses, upper-division seminars, reading 
courses, or self-study. 


The following perspectives may serve as guidelines: 

*Continuations from standard undergraduate courses: Topics may include coding theory, following 
on from number theory and/or algebra, Fourier series from analysis or ODEs, and elementary 
PDEs from analysis and ODEs, or others. 


«Introducing students to topics covered in graduate school: Appropriate subjects may include for 
example, introductory differential geometry, minimal surfaces, introductory algebraic geometry, 
topics in representation theory, complex analysis, or probability. 


*Covering topics outside the standard undergraduate curriculum: Such topics may include game 
theory, mathematical physics, mathematics of finance, mathematical biology, or others. 


Manuscripts intended for submission should ideally contain problems either within the body 

of the text or at the end of each chapter or section. Connections to current research are 
encouraged. This could take the form of reports on recent results and/or lists of open problems 
of continuing interest. 


For more information contact: 

Sergei Gelfand, Director of Acquisitions (sxg@ams.org) or Edward Dunne, Editor for 

the Book Program (egd@ams.org), at the American Mathematical Society, P O. Box 6248, 
Providence, RI 02940-6248, U.S.A.; telephone |-800-321-4267 (U.S. and Canada) or |-401-455- 
4000 (worldwide); fax 1-401-331-3842. 


Titles currently published in this series ... 


An Introduction to the Mathematical Miles of Tiles 

Theory of Waves Charles Radin, University of Texas, Austin 

Roger A. Knobel, University of Texas-Pan American, Volume 1; 1999; ISBN 0-8218-1933-X; 120 pages; Softcover; 

Edinburg All AMS members $13, List $16, Order Code STML/1MM98 

Volume 3; 2000; ISBN 0-8218-2039-7; approximately 200 . 

pages; Softcover; All AMS members $18, List $23, 

Order Code STML/3MM98 
Prime Numbers and Their 

Lectures on Contemporary Distribution 

Probability Gérald Tenenbaum, Université Henri Poincaré, Nancy 

Gregory F. Lawler, Duke University, Durham, NC, and |, France, and Michel Mendes France, Universite 

Lester N. Coyle, Loyola College, Baltimore, MD Bordeaux I, France 

Volume 2; 1999; ISBN 0-8218-2029-X; 99 pages; Softcover; 2000; ISBN 0-8218-1647-0, approximately 120 pages; 

All AMS members $14, List $17, Order Code STML/2MM98 _—«S$oftcover; All AMS members $14, List $17, Order Code 

STML-TENENBAUMM98 


To purchase any of these titles: 
Order from: American Mathematical Society, P.O. Box 5904, Boston, MA 02206-5904, USA.AIl prices subject to 
change. Charges for delivery are $3.00 per order. For optional air delivery outside of the continental U. S., please 
include $6.50 per item. Prepayment required. For credit card orders, fax 1-401-455-4046 or call toll free |-800-32- 
4AMS (4267) in the U.S. and Canada, |-401-455-4000 worldwide. Or place your order through the AMS bookstore at - 
www.ams.org/bookstore/. Residents of Canada, please include 7% GST. 
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KURT KREITH and G. DONALD CHAKERIAN, both, 
University of California at Davis 


ITERATIVE ALGEBRA AND 
DYNAMIC MODELING 


A Curriculum for the Third Millennium 


poe pat iedth Iterative Algebra and 
2 OSEAN Dynamic Modeling 
PEs a a links together the use 
. apes of technology (Excel®, 
nn AND . . | Stella®) and modern 
DYNAMIC: | mathematical techniques 
MODELING | to explore the interaction 
«>.< | of algebra (at the precalcu- 
ACuiiciun torte | Tus level) with computer 
————-—"_ and graphing calculator 
technology. The book will find use in a variety 
of college courses, and also in enrichment 
courses at the high school level. 


1999/331 PP., 192 ILLUS./HARDCOVER/ $49.95 
ISBN 0-387-98758-4 
TEXTBOOKS IN MATHEMATICAL SCIENCES 


MICHAEL W. FRAZIER, Michigan State University 


AN INTRODUCTION TO WAVELETS 
THROUGH LINEAR ALGEBRA 


This purpose of this text is to bring together 
different topics covered in the undergraduate 
curriculum and introduce students to current 
developments in mathematics and their applica- 
tions. The only prerequisites assumed are a 
basic linear algebra background and a bit of 
analysis background. 


1999/516 PP., 46 ILLUS./HARDCOVER/ $49.95 
ISBN 0-387-98639-1 
UNDERGRADUATE TEXTS IN MATHEMATICS 


JAMES J. CALLAHAN, Smith College, Northampton, MA 


THE GEOMETRY OF SPACETIME 


An Introduction to Special and General Relativity 


In The Geometry of Spacetime, James Callahan 
explores the way an individual observer views 
the world and how a pair of observers collaborate 
to gain objective knowledge of the world. To 
encompass both the general and special theory, 
Callahan uses the geometry of spacetime as the 
unifying theme of the book. 


1999/APP. 456 PP., 217 ILLUS./HARDCOVER/ $49.95 
ISBN 0-387-98641-3 
UNDERGRADUATE TEXTS IN MATHEMATICS 


Springer 


www.springerny.com 


MATHEMATICS 


SABER N. ELAYDI, Trinity University, San Antonio, iy 
AN INTRODUCTION TO 
DIFFERENCE EQUATIONS 


Second Edition 


This book integrates both classical and modern 
treatments of difference equations. It contains 
the most updated and comprehensive material 
on stability, Z-transform, discrete control 
theory and asymptotic theory, continued 
fractions, and orthogonal polynomials. The 
presentation is simple enough for the book 

to be used by advanced undergraduate and 
beginning graduate students in mathematics, 
engineering science, and economics. 


1999/APP. 480 PP., 64 ILLUS./HARDCOVER/ $54.95 
ISBN 0-387-98830-0 
UNDERGRADUATE TEXTS IN MATHEMATICS 


SERGE LANG, Yale University, New Haven, CT 


MATH TALKS FOR 
UNDERGRADUATES 


For many years Serge Lang has given talks to 
undergraduates on selected items in mathemat- 
ics which could be extracted at a level under- 
standable by students who have had calculus. 
Written in a conversational tone, Lang now 
presents a collection of those talks as a book. 
The talks could be given by faculty, but even 
better, they may be given by students in seminars 
run by the students themselves. Topics covered 
include prime numbers, the abc conjecture, 
approximation theorems of analysis, Bruhat- 
Tits spaces, harmonic and symmetric 
polynomials, and more. 


1999/129 PP., 16 ILLUS./ SOFTCOVER/ $29.95 
ISBN 0-387-98749-5 


Co-published by Birkhduser and Springer 
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